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Chapter  1 


INTRODUCTION 


1.1  Background  and  Scope 

The  first  solution  to  the  problem  of  optimally  estimating  the  values 
of  a  set  of  quantities  from  a  large  set  of  data  Is  generally  attributed  to 
Karl  F.  Gauss'  method  of  least  squares  (1).*  Although  A.  M.  Legendre  offered 
an  early  published  version  (2)  In  1806,  Gauss  provided  the  basic  mathematical 
derivation.  Interestingly,  the  method  was  developed  and  applied  to  classical 
problems  of  orbit  determination.  Nearly  170  years  later.  It  Is  now  used  as  a 
fundamental  technique  In  space  vehicle  tracking  and  modern  orbit  determination. 
In  fact,  the  wide  applicability  and  use  of  least  squares  In  all  fields  of 
engineering  Is  testimony  to  the  genius  and  Insight  of  Gauss. 

Although  Intermittent  developments  of  some  Importance  occurred, 
particularly  the  Ideas  of  probabilistic  approaches,  It  was  not  until  after 
the  first  decade  of  the  twentieth  century  that  the  foundations  of  estimation 
theory  were  extended  at  a  level  of  significance  parallel  to  Gauss'.  R.  A. 
Fisher  (3)  Introduced  many  of  the  terms  used  to  characterize  the  performance 
of  estimators;  his  concepts  and  efforts  provided  fertile  ground  for  further 
developments  and  new  approaches  to  estimation  theory.  In  1942,  Norbert 
Wiener,  considered  today  as  one  of  the  world's  leading  mathematical  analysts, 
produced  the  so-called  Wlener-Hopf  Integral  equation.  The  solution  to  this 


*  Parenthesized  numbers  Indicate  references  as  enumerated  in  the 
Reference  section.  When  specific  pages  are  referenced,  they  are  separated 
from  the  reference  number  by  a  comma.  Thus  (5,  10-12)  indicates  reference 
S,  pages  10  through  12. 
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equation  Is  a  weighting  function  which,  when  combined  with  a  linear  measure¬ 
ment,  results  In  an  estimate  of  the  desired  quantity  which  minimizes  the 
error  In  a  mean-square  sense  (more  exactly  the  Wlener-Hopf  Integral  offers 
a  method  for  producing  linear,  minimum  variance  unbiased  estimates). 

Although  Wiener's  work  Is  truly  significant,  the  Integral  equation 
Is  limited  In  Its  practical  application.  With  the  growth  of  statistical 
communication  theory,  Wiener's  technique  received  wide  attention.  Several 
attempts  were  made  to  Improve  and  generalize  the  theory;  however,  none  of 
these  Increased  Its  basic  utility  and  applicability. 

The  development  of  the  digital  computer  provided  a  practical  alterna¬ 
tive  approach  to  extending  the  applicability  of  estimation  theory.  Rather 
than  attempt  analytic  extensions  and  solutions  to  the  Wlener-Hopf  Integral 
equation,  R.  E.  Kalman  and  R.  S.  Bucy  (4)  derived  a  differential  equation 
from  the  Wlener-Hopf  Integral.  The  computational  efficiency  of  the  digital 
computer  made  the  numerical  solution  of  the  differential  equation  practical 
and  resulted  In  a  widely  applicable  algorithm  for  providing  linear,  unbiased, 
minimum- variance  estimates.  Today,  particularly  In  navigation  and  guidance 
applications,  "Kalman  Filtering"  as  It  has  come  to  be  called,  ranks  next  to 
least  squares  In  popularity. 

In  spite  of  the  utility  of  the  Kalman  filtering  algorithm,  the  tech¬ 
nique  suffers  from  a  particularly  severe  problem  known  as  divergence  of  the 
estimate.  It  usually  arises  from  the  fact  that  for  the  state  vector  to  be 
estimated,  the  system  dynamic  model  Is  Incorrect.  Operation  of  the  Kalman 
filtering  algorithm  In  the  presence  of  modeling  errors  produces  estimates 
which  are  essentially  worthless:  the  estimated  state  Is  grossly  In  error. 

Divergence  of  the  estimate,  perhaps  the  greatest  detriment  to  the 
Kalman  filtering  algorithm,  has  received  considerable  attention.  As  a  result, 
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various  techniques  have  been  devised  to  compensate  for  modeling  error.  Such 
techniques  may  be  thought  of  as  falling  loosely  Into  two  major  categories: 
adaptive  and  non-adaptlve  methods.  Non-adaptlve  methods  generally  attempt 
to  Improve  the  estimation  process  by  altering  the  filter  structure  In  an  a 
pfUoKi  and  thus  suboptlmal  manner.  They  also  Include  a  priori  approximations 
to  the  actual  modeling  errors.  Adaptive  methods  attempt  to  Improve  knowledge 
of  the  dynamic  model  or  to  Improve  operation  of  the  filter  during  the  estimation 
process. 

The  basic  objective  of  this  study  is  to  investigate  the  utility  of  a 
variety  of  model  error  compensation  techniques,  both  adaptive  and  non-adaptlve, 
and  to  compare  the  effectiveness  of  these  methods.  In  the  remainder  of  Chapter 
1,  the  Kalman  filter  Is  Introduced  and  Its  properties  discussed.  The  model 
error  problem  Is  Illustrated  by  a  simple  example.  A  brief  literature  survey 
of  model  error  compensation  techniques  Is  also  presented.  In  Chapter  2, 
selected  non-adaptlve  error  compensation  methods  are  presented  and  compared 
analytically.  In  Chapter  3,  various  new  and  previously  developed  adaptive 
methods  are  discussed.  Chapters  4  and  5  Illustrate  the  application  of  the 
various  techniques  to  selected  problems,  and  Chapter  6  concludes  the  study. 

1.2  The  Kalman  Estimator  and  Some  of  Its  Properties 

Many  expositions  of  the  Kalman  filtering  algorithm  exist  in  the 
literature,  offering  a  number  of  unique,  yet  unifying  approaches  to  the 
theory  (5,  195-209).  Thus  our  purpose  here  Is  not  to  give  a  rigorous 
derivation  of  the  Kalman  estimator.  However,  as  a  point  of  departure  and 
for  the  sake  of  consistency,  a  somewhat  heuristic  development  will  be 
presented.  Following  this,  some  of  the  notable  characteristics  of  the 
algorithm  will  be  discussed. 
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Before  proceeding*  an  Important  preliminary  must  be  treated.  For  a 
given  dynamic  system,  the  problem  Is  generally  one  of  observing  or  measuring. 
In  a  stochastic  environment,  some  aspect  of  operation  of  the  system,  and 
then  treating  the  measurement  output  In  a  judicious  manner  In  order  to  extract 
specific  knowledge  of  the  system's  performance.  However,  an  Important 
distinction  must  be  made  concerning  the  measurement  process.  On  the  one 
hand  Is  the  case  where  the  measurement  process  Is  carried  out  continuously 
In  time;  this  Is  often  found  In  analog  computing  applications.  On  the  other 
hand  Is  the  case  where  measurements  are  made  at  discrete  points  In  time  In 
correspondence  with  digital  computing  applications.  While  formulations  of 
the  linear,  unbiased,  minimum  variance  estimator  are  known  for  both  cases, 
widespread  application  of  the  digital  computer  focuses  attention  upon  the 
discrete  formulation.  In  the  subsequent  developments  we  will  be  concerned 
with  this  approach  only. 

Within  this  scope  the  problem  may  be  stated  as  follows:  (1)  Given 
a  dynamical  system  modeled  by  the  linear  difference  equation 


xk-i 


*  ^k,k-l  Wk-1 * 


(1.2.1) 


where 


■  *(tj  Is  an  n-vector  of  random  state  variables,  and  x  -  x(t  ) 
k  k  o  o 

Is  given; 


4>.  .  »  Q(t.t  t.  .)  Is  the  n*n  state  transition  matrix*,  with  4> .  «  I; 

A/A*i  A  A“I  J9J 


*  The  properties  of  the  state  transition  matrix  are  well  known.  Some 
of  these  are  mentioned  later. 
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wk  *  *s  an  m“vector  of  random  Input  disturbances  with  the 
statistics 

EKI  -  °‘  £l  uk  "J  I  -  «*  V  K-i  al-i  I  - 

(6  Is  the  Kronecker  delta;  (  )T  denotes  transpose); 
r.  „  ,  -  r (t.t  t.  J  Is  the  nxm  disturbance  transition  matrix;  and 

A  /  a"  J  /C  A— x 

(2)  Given  the  linear  observation-state  relationship 

y.  M  H.  x.  +  v.  (l.C.PJ 

uk  k  k  k 

where 

=  y(tk)  Is  a  p-vector  of  observations; 

Hk  *  H(tk?  a  p*n  maPP^n9  matrix; 

vk  *  v(tk)  is  a  random  P“ vector  of  observation  noise  with  statistics 

*  (■>*!  -  o. E  \vk  "jl  -**  V  *  K-j  f  ‘  °‘ 

E  { vk  w ^  for  all  *  and  j  *; 

(3)  Find  an  unbiased  estimate,  x^,  based  on  the  set,  of  k  observation 
vectors  through  time 

**  E  I  V  V  ' 

l.e. ,  find  x^  =  E  \x(tk\Vk) }  wlthffj^f  =  xk; 

(4)  Such  that  xk  Is  formed  as  a  linear  combination  of  the  k  observation  vectors 

x.  »  L.  £.  .  +  K.  y.t  (1.2.5) 

k  k  k-1  k  ak* 

*  If  and  are  constant  for  all  k,  the  corresponding  noise  processes 
are  wide-senae  stationary. 
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and  such  that  the  state  error  covariance 


Pt  -  PftJV  5  E  {(xk  -  VrxJt  '  V  } 


(1.2.4) 


at  time,  tk,  based  on  Vk,  Is  minimized.  Note  In  (1.2.3)  that  xJt_i  Is  the 
estimate  of  *Jk-1  based  on  y  ,  and  In  this  sense  &k  Is  based  on  all  of  the 
elements  of  Vk.  In  other  words,  given  (1.2.1)  and  (1.2.2) ,  the  problem  Is 
to  find  Lk  and  Kk  such  that  (1.2.3)  Is  an  unbiased,  minimum  variance  estimate 


of  Xk. 


In  order  to  aid  the  discussion,  some  notatlonal  simplifications  are 


made.  In  general,  the  notation 

4 1*  -  ^4 IV 

means  the  function,  /,  evaluated  at  tk%  based  on  the  knowledge  of  V  .  For 
brevity  we  will  simply  equivalence  this  with  the  notation,  f  .  When  no 
confusion  exists,  the  subscript  *  will  be  eliminated  altogether.  Hence, 

*  *  \.*-i  *  «v  w 

p  *  ^^k>  ^k-1^ 

X  -  X.  «  x(tj 
k  k 

£  *  £k  M  £k\k  *  £(ti J  V 


X  *  X.  »  X 


'k  “  X*|*  ”  *  4-1 


*  ■  4 '  *k\k  -  *  -  * 


X  *  X.  *  X 


k  k |ic-i 


X  -  X 


Q  -  Qk  -  QftJ 
R-Fk-  mk> 

p  •  pk  -  pk\k  -  PW 

p-pk‘  4|*-j  -  "VW  -  «*-/  *  r«*-/ 


(1.2.5) 
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L  =  Lkm  L(tk} 


K  *  K.  -  K(tJ 
k  k 


Proceeding,  the  requirement  which  must  be  satisfied  If  £  Is  unbiased  can  be 
obtained  by  using  (1.2.1),  (1.2.2),  and  (1.2.3)  to  form  the  state  estimate 
error,  x,  and  then  taking  the  expected  value.  Thus, 

*k  m  xk  ~  &k  m  k-l  *  ^k-l  ~  Lk  ^ k-l  “  Kk^k 

■  to.  ,  -f  Dj.  -  L.  £.  ,  -  K.  (H,  to.  ,  +  H.  TW.  t  +  v.) 
k-l  k-l  k  k-l  k  k  k-l  k  k-l  k 

£{5^}  -  xk  -  E{&k)  »  0 


to.  ,  -  L.  E{&.  ,}  -  K .  H .  to.  , 
k-l  k  k-l  k  k  k-l 

[(I  -  KH)  *  -  L]  xk_1  -  0 


(1.2.6) 


where  use  Is  made  of  the  facts  that  •  o  and  E{vk)  =  o.  Satisfaction 

of  equation  (1.2.6)  leads  to  the  requirement  that 

L  -  (I  -  KH)  *  (1.2.7) 

and  thus  (1.2.3)  becomes 


**  - ri  -  w  ♦  Vi  ♦ *** 

-  *  ak-1  +  K(yk  -  H  *  &k^)  (1.2.8) 

By  using  (1.2.1),  (1.2.2),  and  (1.2.8)  the  state  error  and  the  state  estimate 
error  covariance  matrix  can  be  obtained  as  follows: 


4>  *1.  , 

+  r  wu  , 

-  K(H  4»  x.  . 

+  h  r  w.  .  +  v  ) 

k-l 

k-l 

k-l 

k-l 

(I  -  KH) (t  * 

+  r  a.  .)-  K 

k-l 

Vk 

(1.2.9) 

p*  - **>  - E{a  - m,f*  h-i  * r  uk-i'  • 

7®  gi  w  <w  gi  ei  ip  ip 

{*k-\  *  *  h1 k-l  r  )(T  ~  H  K  )  -  n  -  KU)(*  *k-i  +  r  u'k-i,vk  K  ' 

*p  m  tp  TT  71  rn 

k  vjx!  ,  $' '  +  a?  ,  r)(i  -  ir  kt )  +  k  0.  vT  r] 

k  k-l  k-l  k  k 
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Combining  terms,  and  recalling  that  vj)  ■  0 ,  wp  *  0, 

E{wk  w**  *  0  and  E{vk  yI-i}  *  °*  (se«  page  4)  one  obtains 

Pk-  (I  -  KH)  P  ♦**  +  r  Q  VT)(I  -  HT  KT)  +  K  R  KT  (1.2.10) 

In  view  of  the  definition  of  P  In  equation  (i.2.5)t 

P  »  (I  -  KH)  P  (I  -  HT  KT)  +  K  R  KT  (1.2. 11) 

ne  first  variation  Is  now  taken  with  respect  to  K. 

6P  -  (I  -  KH)  P  (-HT  61?)  +  K  R  6?  +  (I  -  6/W;  T  (I  -  HT  K*)  +  6K  R  KT 

(1.2.12) 

Necessary  and  sufficient  conditions  that  P  be  a  minimum  are  that  (1)  6P  -  o 
and  (2)  that  the  second  variation  of  P  be  positive  definite.  Solving  (1.2.12) 
for  K  with  6P  »  0  leads  to 

K  -  P  ^(H  P  HT  +  R r1  (1.2. 1$) 

For  an  arbitrary  n- vector,  a,  the  corresponding  quadratic  form  for  the  second 
variation  of  P  Is  required  to  be 

8T  6*P  e  -  8T  6K(H  P  HT  +  R)  6KT  8  >  0  (1  2  14) 

Thus  If  P  Is  positive  definite,  then  the  bracketed  term  In  (1.2.14)  must  be 
positive  definite,  and  fulfillment  of  ( 1.2.14 )  guarantees  the  existence  of 
the  Inverse  In  (1.2.13). 

With  K  given  by  (1.2.13)  the  updated  state  estimate  error  covariance 
may  be  obtained  by  substituting  for  K  In  equation  (1.2.11). 

P  -  (I  -  KH)  P  (I  -  HT  KT)  +  K  R  KT 

•  P  -  K  H  P  -  P  HT  KT  +  K(H  P  HT  +  R)  KT 

•  V-khT-TiFiP  +  T  HT(HT  V  HT  +  R)~2  (H?  RT  +  R)  KT 

or 


P  -  (I  -  KH)  P 


(1.2.15) 
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which  is  the  required  covariance  update  equation. 

The  system  dynamics  are  given  exactly  by  equation  (1.2. l).  The 
random  Input,  w ,  Is  unknown;  however,  being  probabilistic  in  nature,  the 
expected  value  of  the  equation  may  be  taken  conditioned  upon  the  observations 
up  through  time,  ty.  Thus 

"  *k,k-l  E^xk-l  I  Vk-1  *  +  Tk,k-1  FHc-l  I1*-!*  * 

with 

Eh»k\Vk)  *  Sfo*)  -  o 
or 

«  &tu \Vu  ,)  s  K  u  ,  *u  ,  *  *u  ,  t  *  *  (1.2.16) 

k  k'  A-l  A,A-2  A-i  A-i ,A 

Equation  (1.2.16)  provides  the  estimate  of  x  at  time  tk  based  upon  observations 
through  time  t  .  It  Is  also  an  unbiased,  minimum  variance  estimate  (5,  201). 
and 


Vk  *  p(tk\yk^  *  *  pk _i  *r  *  r  a  rT  c.2.17) 

Is  the  predicted  value  of  the  error  covariance  at  tk  given  observations 

through  time  t  (5,  201).  Equation  (1.2.17)  follows  directly  by  forming 

the  state  estimate  error  covariance  using  equations  (1.2.1)  and  (1.2.16). 

Equations  (1.2.16),  (1.2.17),  (1.2.13),  (1.2.6)  and  (1.2.15)  are  the 

equations  of  the  classical  linear,  unbiased,  minimum  variance  estimate.  The 

algorithm,  denoted  as  Algorithm  I,  Is  summarized  below. 

Given:  the  a  pxiotii  Information  B.  ,  P.,  y u,  and  k  *  1.  Compute: 

0  0  *+ 


1. 

\ 

4. 

*k  "  Xk 

*  Vk  -  Hk  V 

2. 

\  -  1  pk-i  »T  +  r  rT 

5. 

Pk’(I 

■  Kk  Hk>  ^k 

3. 

**  '  \  Vs*  h  *1  *  V'J 

6. 

k  «  k  + 

1 t  repeat 
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The  Kalman  estimator  above  Is  based  on  linear  system  dynamics.  It 
has  been  applied  successfully  to  many  nonlinear  problems  In  the  following 
manner. 

Consider  the  n-element  state  vector,  x(t),  whose  values  at  discrete 
times  are  given  by  the  nonlinear  difference  equation, 

X(tk)  -  nxn^),  tk] .  rk  k_1  n.s.w 

where  w  Is  a  random  Input  with  properties  as  specified  by  (1.2.1)  %  and  r  Is 
the  corresponding  disturbance  transition  matrix.  Define  a  reference  state, 
X*(t)  .  If  F[X(tk_])t  tj)  Is  continuous  for  tk>ti>.  tk_jt  then  a  linear 
approximation  to  F  may  be  obtained  by  expanding  In  a  Taylor  series  about 
to  obtain 


3  F  * 

F(Xk-l *  V  ~  F(Xk-l*tk)  +  3*.  (Xl 


-  X. 


3  V 

~  Xk*  +  (Xk~r  xk-l*} * 


(1.2.19) 


where  the  subscripts  denote  values  at  the  corresponding  times.  Substituting 
Into  ( 1.2.18 )  we  have,  to  first  order  In  x 

k-l 

3 F  * 

x.  -  x*  +  (x.  .  -  x.  *)  +  r  v 

k  k  dX.  .  k-l  k-l  k,k-l  k-l 
k-l 

Defining  x.zx.-  x  *  and  $  =  3 F.  V3*.  t,  then  obtain  (1.2.1). 

j  j  j  a  a"* 


*.  »  ,  ,  x.  ,  +  r.  .  .  w.  , 

k  k,k-l  k-l  k,k-l  k-l 


(1.2. 1) 


A  similar  linearization  Is  performed  If  the  observation-state  relation 


Is  also  nonlinear,  but  with  additive  noise,  v(t  ). 
Y(tk)  -  G[x(tk),  tk ]  +  v(tk) 


(1.2.20) 


t  Note  that  this  is  the  definition  of  $  for  the  linear  case  as  well. 
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where  Y(tk)  Is  a  p-vector  of  observations.  Defining  a  nominal  or  reference 
observation, 

V  *  G[xk*3  tk]  +  V  (1.2.21) 

where  the  subscripts  again  denote  values  at  the  corresponding  time,  and 
expanding  In  a  Taylor  series  we  have 


v  - V' +  • 

k 

or  to  first  order  in  xk, 


.  +  v. 


Hk  xk  +  vk 


(1.2.2::) 


(1.2.2c) 


where 


»*  * r*  - v* 

Thus  the  optimal  linear  estimator  previously  presented  may  be  applied  to  the 
cases  of  nonlinear  dynamics  and  observation  processes.  However,  accurate 
estimates  of  the  deviation,  ,  from  the  reference  or  nominal  solution  require 
that  the  linearity  assumption  be  valid.  It  Is  possible  that  large  deviations 
can  violate  the  linearity  assumption  resulting  In  divergence  of  the  estimate. 
One  method  of  reducing  this  possibility  Is  to  employ  the  so-called  extended 
Kalman  filter  In  which  a  new  reference  Is  chosen  at  each  observation.  In 
particular,  after  each  observation  Is  processed,  the  reference  solution  is 
updated  by  setting  the  nominal  equal  to  the  new  estimate: 

After  forming  x*%  £.  Is  set  to  zero  so  that  £  =  x  »  o.  Then  the  next 

K  A  A  K »  A 

updated  estimate,  at  t  ,  Is  found  as 

£  a*  K  U 
k+1  k+1  “ k+1 


or.  In  terms  of  x , 


4 

The  notation  Xk  Is  no  longer  necessary  since  It  Is  Identical  to  X^.  The 
extended  Kalman  algorithm,  Algorithm  II,  Is  summarized  In  Figure  1.2  for  the 
nonlinear  problem. 

Usually  the  difference  equation  (1.2.1)  Is  obtained  from  the  solution 
to  the  linear  vector  differential  equation 

x(t)  -  A(t)  x(t)  B(t)  u(t)  (1.2.::?) 

where  u(t)  Is  some  Input.  For  the  nonlinear  problem,  A(t)  »  a f(X*»  t)/&*(t )* 
where  x*  satisfies  the  nonlinear  differential  equation,  x*(t)  *  f(x*,t). 

The  solution  to  (1.2.27)  Is  well  known  (39,  41-43)  (15,  31-43)  in  the  form 

x(tj  -  t  x(t .  )  +  [k  *(t.  x)  b(x)  u(t)  d  T  (1.2.2s) 

a  K  A- 1  A- 1  wt  a 

Vi 

The  state  transition  matrix,  $,  satisfies  the  homogeneous  differential  equation 

h(t,  ty  -  Mt)  ty,  t ty  -  1  ( 1.2.29) 

*  also  has  the  following  properties: 

4T t^,  ty  =  t^,  ty  t j 3  ty 

V  *  *rv  v 

When  u(t)  Is  replaced  by  white  noise,  with  ^{uft;  ur(V}  * 
i/(t;  6ft-TJ+,  difficulties  occur  In  attempting  to  evaluate  the  resulting 
stochastic  Integral  In  (1.2.28).  The  problem  arises  from  the  fact  that  the 
elements  of  white  noise  are  uncorrelated  In  time,  and  hence  are  nowhere 
continuous.  A  common  approach  to  remedy  this  (44,  326-327)  (39,  115-117) 

Is  to  select  t,t  =  tk  -  t  small  compared  to  the  system  characteristic 


^5ft  -  t)  is  the  Dirac  delta. 


Pk  [A(t)P(t)  +  P(t)AT(t ) 

tk-l  +  B(t)Q(t)BT(t)  )  dt 


Figure  1.2  Extended  Kalman  Filter:  Algorithm  II 
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response  time,  and  approximate  u( x)  as  a  constant  'J(tk_ij  over  At.* 

In  this  case  we  define 

T(tk>  tk-i)  ~ f  *(tk *  T)  B(x)  dT  *  UTlVso) 

tk-i 

This  Is  the  approach  taken  to  obtain  equation  (1.2.1).  An  alternative  method 
is  to  define,  formally, 
du'*  u(x)  dx 

and  employ  the  definition  of  the  ltd  stochastic  Integral  (5,  98)  to  obtain 

tk 

J  Wt^x)  B(x)  dw  *  \tk.x  Bk  (»k  -  u'k_i)  (1.2.31) 

tk-l 

In  this  case  one  cannot  write  the  form  r,  ,  ,  w,  Another  alternative,  the 

k,k-l  k-1 

Stratonovlch  stochastic  Integral,  yields  the  same  result  for  this  case 
(5,  116-120).  The  most  general  consideration  Is  simply  to  define 

tk 

W(tk-1}  ~  J  *(tk*  T)  B(r)  u(x)  dT  (1.2.32) 

tk-l 


In  this  case,  r.  .  .  *  I.  The  differences  In  these  definitions  are  mani- 

k,k-l 

fested  In  the  estimation  process  through  the  corresponding  covar.ance  terms. 
Thus  for  (1.2.30)  one  finds 
r  e{w  1/}  rT  = 

tk  tk 

f  x)  B(x)  dx  E{u}(_1  /  BT(j)  *T(tk,  x)  dx  (1.2.33) 

Vi  Vi 


For  the  It6  integral, 

r  e{w  wr}  rT 


4>  .  ,  B.  E{(W : 

k,k~l  k  k 


k-l  k 


U'  )}  B 
k-l  5  k 


k,k-l 


Carrying  out  the  multiplication,  this  becomes 


*  A  white  Gaussian  process  may  be  shown  to  be  the  limit  of  a  white 
Gaussian  sequence,  e.g.,  (5,  83-85). 
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mu  ur)  rr  -  *  B  E{w'  w'T  +  w'_z  v£  )  BT  4>T  (1.2.34) 

Since  E^wf^  m  Q1  6 i}  _ _ _ _ _  ___ 

Finally,  for  (1.2.32) 

TE{u  uT}  rT  «  E{u  wT)  - 

fj  *<**•  ■<>  B<'>  B^>  ^  >  bT/°>  *>  ** 

*k-l 

But  S{u(t)  uT(0))  -  u(t)  «Ct  -  p),  and  since  (44  ,  332) 

i1-  K-i  « 1 « 
is,  Vj  > T  >  h 


6(t  -  p)  dp 


' k-1 


then 


E(d  ur}  mf  9(t^  x)  B(x)  U(x)  Bt(  x)  <b(t  ,  x)  dx 


nO  7C  1 

9  U  9  0£V 


*k~l 


Noting  Q  =  E{Wj  uj}  *  £{u  u7},  one  finds  for  ( 1.2.33 ) 
respectively, 

r  q  rT  -  r  u  rT 

r  q  rr  -  2  ♦  b  q'  bt 
** 

r  q  rT  •  q  *  f  $  b  v  bt  $T  dx 


(1,2.34).  and  (1,2.25). 

(1.2,.  3  3a  * 

( 1.2.34  a ' 


Thus,  depending  upon  how  one  depute*  the  stochastic  integral  in  (1.2.28) 
various  forms  may  be  obtained  for  r  Q  rr. 

With  the  extended  Kalman  form,  the  state  transition  matrix  is  used 


only  In  the  prediction  of  the  state  error  covariance  matrix.  It  is  possible 
to  eliminate  the  state  transition  matrix  altogether  by  use  of  a  differential 
equation  for  the  error  covariance,  obtained  by  a  straightforward  limiting 
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process.  First  we  write  (1.2.17)  such  that  the  forcing  term  r$r  Is  In  the 
form  (1.2.35).  Letting  t  -  t^,  t  *  tJt_J ,  and  defining  At  et  -  t,  we  have 

- mr^Tf  — 

( 1.2.M) 

Using  the  Taylor  series  expansion, 

P( t)  »  [$(t,t)  +  i>(tjt)  At  +  ...]  P(t)  [t(tst)  -f  Q(t,t)  At  +  ...]T 


W 

Lf  B(e)  Ufa)  BT(e)  $(1,8)  da  + 

Lt 


X»t 

~  /  $(T ,a)  B(e )  Ufa)  BT(a)  ^(x.s)  da  At  +  . .  .J  (1.2.37) 

where  Q(t,t)  -  d$(xtt)/dx  evaluated  at  x  =  t.  The  first  Integral  In  (1.2.37) 
Is  zero.  Applying  Leibnitz'  rule  to  the  second  Integral  we  have 


T «t 

^  f  Q(xts)  Bfs )  U( 8 )  BT(s)  $T(t ,8)  da  At  * 

£ 

T=t 

B(t)  U(t)  BT(t)  At  +  f  [$<%e;  B(b)  U(b)  BT(b )  *T(x,a)\  da  At 

t 

(1.2.3$) 

Noting  that  the  Integral  here  Is  also  zero,  and  that  i(t,t)  =  A(t),  one 
finds  on  substituting  Into  (1.2.37)  that 


P(x)  -  P(t)  +  Aft)  P(t )  At  +  P(t)  AT(t)  At  +  B(t)  U(t)  BT(t)  At  *  ... 
Now,  by  definition  P(tk_2)  *  p(tk.1 1 »  and  »  P(tk  |  y  ) .  Since  no 
new  Information  Is  being  added,  then  at  t  *  tJt_J  we  may  make  the  substitution 
p(t)  for  rrth  Then  substractlng  T(t)  from  both  sides,  dividing  by  At  and 
taking  the  limit  as  At  -*■  o,  one  obtains  the  differential  equation 

P(t)  *  A(t)  P(t)  +  P(t)  A  (t)  +  B(t)  Q(t)  BT(t) 


(1.2.30'' 
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Since  P  Is  a  symmetric  n*n  matrix,  only  n(n+l)/2  equations  need  to  be 
Integrated.  <t>  Is  nxn  and.  In  general,  not  symmetric,  so  a  reduction  in 
-•mimerfcarl  effirrt'Ts  'ttstyrnetf  by  Tn  teg  rating  the  appropriate  n(n+l)/2 
equations  for  p.  On  the  other  hand,  depending  upon  the  specific  problem  at 
hand,  It  Is  often  possible  to  reduce  the  number  of  equations  In  $  «  d*  by 
obtaining  closed  form  solutions  for  some  of  the  elements.  Also,  it  has  been 
found  that  (1.2.39)  can  be  a  difficult  equation  to  Integrate  numerically. 

1.3  Properties  of  the  State  Error  Covariance 

This  section  deals  with  some  characteristics  of  the  state  error 
covariance  matrix.  We  will  not  elaborate  at  length  —  a  number  of  Investi¬ 
gators  have  done  extensive  work  In  this  area,  notably  Kalman.  However,  some 
of  the  key  properties  are  presented. 

We  first  show  that  any  covariance  matrix  Is  non-negative  definite. 
For  a  vector  random  variable,  zt  with  mean,  u,  and  covariance,  z,  consider 
the  quadratic  form 

q  *  a  Za  (1.3.1) 

where  a  Is  an  arbitrary  vector  of  constants.  Since  the  expected  value  Is  a 
linear  operator, 

q  =  aTE{(z-\i)  (z-\i)T)  a  *  E{aT(z-\i)  (z-\i)Ta)  (1.3.2) 

Defining  the  scalar 

«  T 

8  =  a  (z-\i) 

we  have 

q  -  E{8  8T)  -  Sis2)  (1.3.3) 

which  is  never  negative.  Thus  Z  is  non-negative  definite. 

Next,  positive  definite  bounds  may  be  established  for  the  error 
covariance  associated  with  the  Kalman  filter.  In  particular,  Sorenson  (7) 
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takes  a  rather  unique  approach  by  decomposing  the  system*  equations  (1.2.1) 
and  (1.2.2)  Into  two  separate  parts.  One,  denoted  as  p,  has  Input  noise, 

-but-  frf—fr-  measurements^  the- ether ,  denoted -as~mv-has-  no  1  nput-iwfse;  tnrrd - 

Imperfect  measurements.  Thus, 


Xa  + 
k  k 


m  p 

VkMyk  +  yk 

where  the  systems  m  and  p  are 


m: 


rm 

P 

m 


tor1 


k-1 


«*l  *  •* 


p •• 


x?  *  ftrf  +  r«  . 
k  k-1  k-1 


yPk 


Ha* 

k 


1 1.3.4 ) 


(1.3.3) 


(1.3. 6) 


Sorensen  then  discusses  the  covariance  properties  for  these  two  systems 
separately.  For  the  noise-free  plant,  m,  the  observability  matrix,  defined 
as 


t  t.j-i  "I  "'i1  't 

l=j 


,1-1 


(l. 


n  I 


Is  used  to  establish  the  positive  definite  property  of  the  state  error 
covariance  matrix.  Specifically,  a  matrix  Inversion  lemma,  the  so-called 
Schur  Identity  given  in  Appendix  A  allows  (1.2.15)  to  be  written  as 


K  -  i™*  **>  V-  'V2" +  <  R'k  «*>'J 


(1.3.  8) 


where  now  $r,  since  m  Is  the  noise  free  plant.  Then, 


P"Jm  -  *’IIr  .  Pllm.  ,  +  ht  r1  h 

k  k,k-l  k-1  k,k-l  k  k  k 


(1.3.9) 
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The  system  Is  said  to  be  observable  on  the  Interval  [ t y  tk ]  If  the 
observatlllty  matrix  M.  .,  Is  positive  definite.  If  this  condition  holds, 
.and  P*-ls  also  pos1tive.-def4-n4tev-then  ^.*£.0^can-bewrittenafs  ~  ~~ 


p"1  -  <t i  \p~lm  +  M  ]”J  <t>T 

*  k,0  1  0  k,ll  k,0 

where  use  Is  made  of  the  fact  that 

p  ~lm  =  [$  p"  <j>T  i"J 

k  l*k,k-l  k-1  Jt  ,/c-J J 

a  p~lm  <jf  T 

k,k-l  k-1  k,k-l 

*  ♦  P_im  $r 

k,k-l  k-1  k,k-l 


( 1. 3.  1C) 


By  arguments  similar  to  the  previous  discussion,  Is  therefore  positive 
definite  for  all  k ,  since  the  bracketed  term  In  (1.3.10)  Is  also.  Additionally, 
Sorenson  shows  that  m  may  be  described  In  a  recursive  fashion  by 

cr 


i  ,  j  +  K  *  i  Rl1  K  i  , 
k,j  k-l,j  k,j-l  k  k  k  k,j-l 


(1.3.11) 

and  since  the  second  term  on  the  right  Is  non-negative  definite, 

M  -  M.  -  .Is  also,  thus  allowing  the  conclusion,  by  (1.3.10),  that  p™ 

K,j  K—lf]  K 

Is  positive  definite  for  all  k  >  j. 

Building  on  these  facts,  Sorenson  also  states  the  well  known 
characteristic  that  p£  generally  tends  to  vanish  for  well-defined  systems 
with  no  input  noise.  First,  the  concept  of  g-stage  observability  is  defined. 
Given  l  <  a  <  n  such  that  t,  <  t.  and  t.  <  ,  the  system  m  is  said  to 

be  <y-stage  observable  on  an  Interval  [tQt  tN]  if  and  only  if  Mk  k_q+1  Is 
positive  definite  for  every  t  .  That  this  Implies  m -  m.  Is  positive 

AC  AC  / 1  1 1 

definite  may  be  readily  seen  by  modifying  (1.3.11)  to  obtain 


M  =  M  +  V  M  $ 

k,l  k-q,l  k-q,0  k, k-q+l  *-g,0 


(1.3.  ir> 


Thl 


s  follows  by  substituting  "kfk_q+J  using  (1,3.7).  Under  <?-staqe 


observability,  ( 1.3.12 )  suggests  that  In  some  sense  Mk  _  Increases  without 
“bound.  Consider  the  spectral  norm  of  M  and  w"1: 

II  u"  1  II  /yk 


(1.3. 13) 


HVlII  * 

Je  Jr 

where  X  and  X  .  are  the  maximum  and  minimum  eigenvalues  of  m,  ,  .  Now 
max  min  9  k,l 

if  M.  ,  Is  o-stage  observable, 

Kfl 

k-q 


X*  >  Xk~9  ,  x\  >  X"  , 
max  max  min  min 


with  similar  Inequalities  for  the  other  eigenvalues.  Then 


and 


II  «*,2II  -VZ 


Lim  .  k 
k+»  mjt 


kl  I  K.i  n  -2 


rl 


Lim 


Xk. 

min 


( 1.3.14 ) 


Using  this  result  together  with  (1.3.10),  Sorenson  realizes  the 

rl 


following  conclusion:  For  a  ty-stage  observable  system,  If  ||w”  ,||  converges 

k  ,1 

fa 
k 


to  zero  more  rapidly  than  1 1^  Q  \  |2  Increases,  then  the  error  covariance 


vanishes  as  k  ■*  «.  To  verify  this  we  merely  need  to  show  that  the  norm  of 
the  elements  of  (1.3.10)  vanish.  First,  for  no  a  piioki  Information, 

P~lm  *  0.  Thus*, 


and 


K  n  *1  n>  K  n  I^1”  +  ,  l”-*  „ 

k,0  k,l  k,0  k,0  0  k,l  k,0 


I 


( 1.3.15 ) 


(1.3.16) 


which  vanish  as  k  +  »  If  the  original  hypotheses  are  met.  Further,  as  k  -*■  ®, 
(1.3.15)  tends  to  equality  with  Increasing  accuracy.  The  Implication  Is  that 


*  A  j>  B  >  C  means  that  A  -  B  is  non-negative  definite  and  P  -  /  is 
positive  definite. 


v  t'T-; 


i: 

for  large  k,  Is  essentially  independent  of  P^  for  <?-staqe  observable 
systems. 

For  system  p,  where  the  measurements  are  perfect,  one  Is  able  to 
conclude  that  the  error  covariance  matrix  is  never  positive  definite.  Using 
( 1.2.15 )  and  (1.2.13), 

H.  pf  -  HJI  -  Ku  HA  PP 
k  k  k  k  k  k 

«  {H  -  H  PPHT[H  PpHT]~1  H)  Pp 

m  0  (  1  •  o  •  i  f ) 

Thus  If  p£  were  positive  definite,  Hk  would  have  to  assume  the  contradictory 
state  of  having  to  be  Identically  zero.  Utilizing  the  gains,  A'”  and  Kp 

/C  A 

which  are  optimal  for  each  system,  Sorenson  then  shows  that  a  lower  bound 
on  the  total  error  covariance  Is  given  by 

P.  >  K  +  K 
k  —  k  k 

or 

P.  >  K  Jpnlm  +  ,ri  (1.3.18) 

k  —  k,0  0  k,l  k  ,0  k 

Utilizing  the  gain  A^,  which  Is  suboptlmal  for  the  total  system  but  may  be 
optimal  for  either  (but  not  both)  of  the  systems  m  and  p,  an  upper  bound  Is 
established  as 


P.  <  pj®  +  pj® 
k  —  k  k 

where  p"*  and  p ps  are  the  corresponding  covariances.  If  Is  chosen  to  be 
(the  optimal  gain  for  system  m),  and  used  for  both  systems,  then 

p„  i  **/0I  r’o"  *  \.o  *  1 

But  will  generally  Increase  without  bound  since  noise  Is  continually 
being  Input  to  the  system,  and  therefore,  as  Sorenson  points  out,  ( 1.3.19) 
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Is  an  unsatisfactory  bound.  Note  that  In  this  case  (1.3.1?)  generally  will 
not  hold,  since  the  gain  Is  not  the  optimal  for  system  p.  If  In  fact, 
r^r3,  Is  positive  definite,  then 


-  [i  -  pm ht(h  pm ht  v-  r)~ 1  H]  p 

-  (  i  -  k°  h)  [  *  p^2  v-  r«Jt_JrTi 
»  (  I  -  if  H)  [  Pkm  +  AP  p  +  Tfl  rT] 
>  r«Jfc_Jrr  v-  AP  p  >  r^r* 


(1.3.20) 


This  last  Is  true  since  and  hence  k"1  eventually  vanish.  Tnus  (1.3.18) 
and  (1.3.19)  represent  positive  definite  bounds  upon  the  total  system  error 
covariance,  P.. 

k 

An  alternate  definition  of  observability  requires  that  the  Information 
matrix,  T.  .,  be  positive  definite: 

K>9J 


>  0,  k  >  j 


(1.3.21) 


If  this  condition  holds,  the  system  ( l. 
observable  with  respect  to  (y  ^  y j+1>  • 
to  the  observability  matrix  by 

M  =  $>T  T  4> 

k,j  k,j-l  lk,j  vkfj-l  • 

and  related  to  the  error  covariance  by 


P~lm  a  4?  p~lm  A  +  T 

k  0 ,k  0  O/k  k,l 


2.1,  1.2.2)  is  said  to  be  completely 
. .,  n  ).  Note  that  T  .  is  related 

‘  A  A  /  7 


(1.3. 
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( 1  •  .■> .  2  o 


This  latter  follows  from  (1.3.10).  A  recursion  for  T  may  also  be  obtained 
using  (1.3.11)  and  (1.3.22): 
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T  m  M  i  M  <J>  +  UT  R~ J  // 

k,j  vj-l,k  ”k,j  vj-l  vj-l,k  nk-l,j  vj-l,k  *  k  k  k 

rn  $T  $T  M  <J>  $  +  HT  R~ 1  H 

vk-l ,k  j-l,k-l  k-l,j  j-I/Jt-2  Vi,A  k  k  k 


Tu  J  T.  ,  .  .  .  +  HT.  /TJ  Hu,  k  >  3 

k,j  k-l,k  k-l,j  k-l,k  k  k  k*  ° 


(1.3.24) 


A  concept  dual  to  that  of  observability  Is  controllability.  The 
controllability  matrix  Is  defined  as 


\  =  y*  $  r  q  yt  $T 

k,0  fa  vk,i  1  i,i-l  Vi-J  1  i ,  i  -  2  vk,i 


(1.3.25) 


The  corresponding  dynamic  system  Is  said  to  be  completely  controllable  If, 
and  only  If,  Q  >  0  for  k  >  0. 

Jazwlnskl  (5,  234-243),  drawing  upon  the  work  of  Kalman,  Sorenson, 
and  others,  gives  a  comprehensive  discussion  of  bounds  and  stability  of 
the  filter  equations.  Extending  the  concepts  of  observability  and  control¬ 
lability,  Jazwlnskl  defines  the  system  (1.2.1,  1.2.2)  as  being  uniformly 
completely  observable  If  there  exist  a  positive  Integer,  N,  and  positive 
constants,  a  and  0,  such  that 

0  <  <xT  <  T.  .  <  01 

—  k,k-N  -  p 

for  all  k  >  N.  Similarly  the  system  is  uniformly  completely  controllable  If 

°  *  *  i  i 61 

where  y  and  6  are  positive  constants.  Using  these  definitions,  Jazwlnskl' s 
results  are  summarized  In  the  following: 


Lenina.  If  the  dynamical  system  (1.2.1,  1.2.2)  is  uniformly  com¬ 
pletely  observable  and  uniformly  completely  controllable,  and  If  pQ  _>  o% 
then 
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a)  Pk  Is  uniformly  bounded  from  above, 

b)  Pk  Is  uniformly  bounded  from  below, 

c)  Pk  >  0% 

for  all  k  >_N.  Further,  the  filter  of  Algorithm  I  Is  asymptotically  stable; 
that  Is, 

II tQ)\\  <ox  exp[-oz(tk-  tQ)] 

for  all  >,  tQ  («i  and  02  are  constants). 

In  (5,  244-251)  Jazwlnskl  also  discusses  error  sensitivity.  In  this 
latter  regard,  he  proved  the  following: 

Theorem.  If  P Q  <  pj  ,  <?k  <  .  and  j?k  <  for  all  k,  then 

pk  -  pk  and  pk  -  pkC  for  a11  k' 

The  superscript,  c,  refers  to  the  numerical  values  employed  or  computed  In 
the  filtering  algorithm.  The  non-superscrlpted  quantities  refer  to  the 
actual  or  true  values  of  the  covariances.  Note  that  the  true  values  are 
seldom  known  in  practice.  Hence,  In  light  of  this  theorem  we  can  enjoy  the 
confidence  that  If  conservative  values  are  selected  for  the  covariances,  then 
the  true  error  covariance  Is  bounded  at  any  time  by  the  computed  error 
covariance. 

1.4  The  Problem  of  Modeling  Errors 

Up  to  this  point  In  the  discussion  of  Kalman  filtering  It  has  been 
assumed  that  the  system  dynamics  are  known  to  within  a  degree  of  uncertainty 
represented  by  the  statistics  associated  with  the  state  noise,  w.  If  the 
uncertainties  In  the  model  dynamics  are  purely  random  with  accurately  known 
finite,  bounded  statistics,  and  more  realistically  If  the  uncertainties 
are  small  with  respect  to  the  state  values,  then  generally  the  assumed 
dynamic  modeling  will  yield  good  results.  On  the  other  hand.  If  there  are 
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modeling  errors  which  contain  unbounded  time  correlated  components,  or 
dynamic  biases  as  they  are  usually  called,  the  estimation  results  based 
on  such  erroneous  models  become  worthless.  In  such  cases,  the  system  Is 
not  uniformly  completely  controllable.  In  particular,  the  model  errors 
manifest  themselves  In  the  <l>-matr1x  which  affects  the  predicted  values  of 
the  state,  x,  and  error  covariance,  P.  If  the  filter  employs  no  process 
noise  covariance,  Q ,  or  only  a  constant  Q,  then  as  the  estimation  process, 
proceeds,  the  state  error  covariance  usually*  decreases  and  so,  therefore, 
does  the  gain,  K.  As  the  gain  becomes  smaller,  the  effects  of  the  measure¬ 
ments  In  contributing  information  become  less  and  less.  The  effects  of 
modeling  errors  continue  to  grow,  essentially  no  new  Information  Is  added, 
and  the  state  estimate  diverges  from  its  correct  value.  A  simple  example 
will  be  developed  which  will  serve  to  Illustrate  these  points.  It  is 
derived  from  information  given  in  reference  (9). 

Consider  two  spacecraft  in  orbit  about  the  earth.  Assume  their 
orbits  are  approximately  circular  and  coplanar,  and  that  the  vehicles  are 
undergoing  a  rendezvous  maneuver,  and  hence  are  separated  by  a  distance  of 
only  a  few  kilometers.  Referring  to  Figure  1.3,  let  rk  be  the  geocentric 
position  vector  of  the  target  vehicle  and  rh  be  the  geocentric  position 
vector  of  the  homing  vehicle.  Neglecting  n-body  and  aspherlcal  gravitational 
effects,  the  equations  of  motion  can  be  written  as 


•• 


*  g(rh)  = 


(1.4. 1) 


*  Recall  that  ||4>  J  | 2  In  (1.3.16)  must  not  increase  faster  than 
|  In#.1.!  I  decreases.  A  similar  characteristic  is  required  in  the  system 

Kfl 

with  process  noise. 


where  (')  J(  )A1t*  p  *  gravitational  parameter  of  the  earth,  and  |r|  1r, 
the  magnitude  of  r. 


8 


Figure  1.3  Vehicle  Position  Vectors 


Defining  the  range  vector, 

p  *  r  -  r.  (2. 4.  C) 

t  n 

and  noting  that  |p|  «  | |  or  |r  | ,  one  can  expand  about  the  target  vehicle 
position  vector  In  a  Taylor  series  to  obtain 

9 g(r  ) 

g(vh)  =  g(rt)  +  -557—  -  V  *  e  (U4.S) 

Neglecting  e,  which  represents  terms  of  second  and  higher  order  In  p,  and 
uslnq  (1.4. 1)  and  (l.> l.v.)  leads  to  the  following  relation: 

4>j(r  ) 

••  t  1  1  •  1 

p=-—v 

Note  that  neglecting  e,  as  well  as  the  higher  order  gravitational  effects 
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has  Introduced  errors  In  the  dynamic  model.  However,  for  the  time  being 
assume  that  (1.4.4)  represents  the  true  model,  that  the  orbits  are  coplanar, 
and  that  the  target  veMcle  orbit  Is  circular  with  the  homing  vehicle  orbit 
nearly  so.  Further,  for  purposes  of  developing  a  simple  navigation  scheme 
the  possibly  questionable  assumption  will  be  made  that  the  homing  vehicle 
Is  closing  at  a  constant  rate,  p,  with  respect  to  the  target  vehicle.  Note 
that  p  Is  the  magnitude  of  the  time  rate  of  change  of  the  range  vector,  p. 


l.e.. 


where  p  «  dp/dt.  Throughout  the  rendezvous  maneuver.  It  Is  necessary  to  have 
knowledge  of  p  and  p;  however,  for  simplicity  In  the  Illustration,  only  p  will 
be  considered. 

To  obtain  Information  about  p  a  Doppler  radar  system  Is  used  to  measure 
the  closure  rate.  Associated  with  this  range-rate  radar  are  uncertainties 
assumed  to  manifest  themselves  as  bounded,  purely  random  errors,  yfc,  in  the 
range-rate  measurements.  Assume  the  vR  have  the  statistics 

Pfy^}  *  0t  *  A  A  =  constant 

Making  the  following  Identifications  with  equations  (1.2.1)  and  (1.2.2), 

PQ  «  P(0)t  xk  =  P;r»  Hk  -  ykx^k  +  vk 
and  assuming  no  state  noise  (q  *  o),  then  the  Kalman  estimation  equations  for 
Algorithm  I  are 

a 

”  ^k-l 

P  -  P.  . 
k  k-1 


-j 


-  u  -  V  pk 


(1.4.6) 
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Recall  that  y  Is  the  measured  range-rate  at  t  ,  and  that  />  =  .7  -  p. . 

A  A  A  A  A 

Explicit  expressions  may  be  obtained  for  the  above  quantities  in  terms  of 

A 

P0»  PQ%  and  yk.  In  particular,  the  recurrence  for  Pk  leads  to 

v 


(mPQ  +  R) 


(1.4.7) 


and 


K 


m  (mP0  +  R) 


(1.4.8) 


for  m  observations.  Then  for  p  , 

tn 

m  v. 


p  *  p 

yO  M 


k=l 


From  equations  (i.4.7)-(l.4.9) ,  It  follows  that  as  m  gets  larger,  the  estimate 

A 

for  p  becomes  Insensitive  to  the  observation  residual  since  the  variance  and 

m 

hence  the  gain  are  becoming  smaller.  The  implication  Is  that  eventually  a 
point  Is  reached  where  K  Is  so  small  that  taking  further  observations  adds 
essentially  no  new  information  about  p.  This  is  an  acceptable  (and  desirable) 
situation  if  no  modeling  errors  are  present,  that  is  If  the  assumption  about 
a  constant  closure  rate  is  valid. 

To  Investigate  this  assumption  further,  consider  equation  (1.4.4)  in 
terms  of  the  relative  derivatives  of  p. 


p  =  p  +  2  in  x  p  +  i  x  p  f  u  x  fu  x  pj  (1.4.10) 

O 

where  w  Is  the  angular  velocity  of  the  line-of-slght,  p,  and  (  )  is  the  time 
derivative  relative  to  the  target  vehicle.  Using  equation  (1.4.4)  one  can 


find 


Zg(r.)  0 

p  =  — ^ - p  -  2d)xp.  Jixp.  cox  (u  x  p) 


From  the  first  of  (1.4.1), 


(1.4.11) 
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3  g(rt) 


r  vT) 
t  t 


(1.4.12) 


So, 


P  * 


and 


T 

r  i* 
t  t 


J  P 


t  i .  •(  ■ 


T 

rt  rt 

kt|2 


j  p  -  2  U  X  p  -  u  X  p  -  ID  X  (id  X  pi  (1.4.1!) 


If  the  rendezvous  maneuver  Is  restricted  such  that  p  maintains  a  constant 
angle  with  respect  to  rfc,  then  w  Is  constant.  Further,  because  the  target 
vehicle  Is  In  circular  orbit. 


where  k  Is  a  unit  vector  perpendicular  to  the  orbit  plane.  In  view  of  this 
restriction, 

T 

p  r 

p  - ^ —  (1-3  — — -  J  p  -  2  a  x  p  .  id  x  (id  x  pj  ( 1.4.16 > 

ktr  i*g2 

Now,  take  the  scalar  product  of  (1.4.16)  with  a  unit  vector  In  the  p-directlon 
to  get 


£ 


P  = 


V(I 


T 

rt  rt 

3  ;  p 


i',r 


T 


oj  x  p 


T 

£_ 

P 


CO  X  (Ui  X  v) 


3\i(pT  r  )2 

» - u —  p  + - - — 

lrt  1 3  Plrtl5 

since  pT (u  xpj  •  p  •  (id  x  pj  s  id  > 
(u  •  p)z  -  <*)2  p2  »  -w2  p2.  But  w2  • 


4-  W2  p 


(1.4.1?) 


o  T 

(p  x  p)  =  o,  and  p  [id  x  (id  x  pi]  r 
y/|rt|*  ;  hence, 
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•• 


P 


... 

p|rtl§ 


(\ PJ 


V2 


IS) 


Next, 


ein  ((at  +  t) 

008 

m 

(at 

p  *  p 

-cos  (tat  +  e) 

.  *t  -  ktl 

sin 

(at 

0 

.  0 

J 

where  e  Is  a  small,  constant  angle  representing  the  deviation  of  r  from 
the  local  horizontal  at  the  target  vehicle  (If  e  =  0,  p  and  rfc  are  perpen¬ 
dicular).  Then,  since  pT  r  Is  the  scalar  product  of  p  and  r  , 

t  C 


pT  *  p|rt|  [ein  ((at  +  c)  00 a  uj t  -  00a  (tat  -f  t)  sin  wt] 
*  p|rfc|  [ein  (a t  cos  (at  cos  e  +  ooa2  (at  sin  e  - 
cob  u it  ein  (at  cos  e  +  ein2  (at  sin  e] 

=  p|rfc|  ein  e 


(1.4.19) 


Finally,  through  first  order  terms,  and  for  closure  along  the  line  of  sight 
at  a  constant  angle,  e,  from  the  local  horizontal, 


or 


p  ein2  e 


P  -  Y2  P  *  O3  Y2  **  — sin2  e,  a  constant  (1.4.20) 

ktl* 

Thus,  only  In  the  special  case  where  e  »  0  is  the  assumption  of  a  constant 
closure  rate  valid  (p  *  0  to  first  order).  The  solution  to  (1.4.20)  Is 
found  to  be 


p  »  qq  cosh  yt  +  8inh  yt 


(1.4.21) 


So, 


p  =  pQ  y  sink  yt  +  pQ  cosh  yt 


( 1.4.22 ' 
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and  It  Is  seen  that  there  Is  an  unbounded  growth  in  p.  A  non-dimensional 
form  of  (1.4.22)  is  plotted  in  Figure  1.4. 

Returning  to  the  estimation  problem,  using  the  original  model 
(constant  p),  the  observation-state  relation  becomes 

yk  =  Pjt  +  vk  =  po  Y  8inh  ytk  +  po  coah  ytk  +  vk 

and 

yk  ■  p*  *  °0  y  einh  ytk  *  'po  °°sh  ytk  'K  +  Vk  u.i. ti> 

So  the  estimate  becomes 

A  mm 

K-K*  (k  V  s/pj  l9o  y  sinh  ytk  +  fo  °0Bh  ytk  -  K  *  V 

<M>  +  R/P0  J  .  vk  , 

~  k  +  R/P0  Pk  (k  +  R/Pq) 

k  +  R/P0  (po  Y  8inh  ytk  *  pO  oosh  YV  (1.4.24) 

Forming  the  error,  p^,  by  subtracting  (1.4.24)  from  (1.4.22)  gives 

-  \(k-l)  +  R/p]  .  “  v 

Qk  -  [  trw0  j  (Qo  y  Hnh  ytk  *  ooah  ytk  ■  °k>  -  (k  *  s/P0) 

(1.4.25) 

From  d.4.24)  one  can  see  that  as  k  increases,  the  additive  effects  of  the 

A 

last  term  become  less  (since  y  «  l) ,  and  pk  tends  to  a  relatively  constant 
value.  In  (1.4.25),  the  term  in  square  brackets  tends  to  2,  and  since  p^ 
is  approximately  constant  for  large  k,  the  hyperbolic  trigonometric  terms 
eventually  dominate,  driving  the  error  to  intolerably  large  values.  As 
before,  the  computed  variance  given  by  (hi.?)  decreases  to  zero. 

Consideration  of  state  noise  to  represent  dynamic  model  uncertainties 
has  been  shown  to  have  the  effect  of  keeping  the  error  covariance  from 
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vanishing,  and  likewise,  the  gain.  For  the  rendezvous  example,  let  r  in 

equation  (1.2.1)  be  the  identity  matrix.  Further,  let  the  process  generating 

wk  be  stationary  so  that  the  variance,  Q  =  is  a  positive  constant.  Then, 

P,  *  P.  .  +  Q  1 1  r’ 1 

k  k-1 

For  k  -  lt 


—  _  Pa  +  Q 

Kx  =  Pi/[Pi  +  R]  -  77 TTTTr 


Assuming  the  initial  estimate  error  is  zero,  the  actual  a  pxioii  variance, 
p o,  is  simply  H.  Then  the  simplification  may  be  made  that 


K  l 


P  +  k  l  £  f.  f  ,  ‘1 

2R  +  Q  2  +  f  1  J  ~  R 


and 


For  k  =  2S 

?!  * * [jt# «  * " [rH * f]  - "  ['  V/f 


P2 


*2  = 


r  LL±JL±  f2)/(z  +  f)  =  i  +  ±  t'2 

P2  +  R  (1  +  3f  +  f2)/(S  +  f)  +  1  3  +  4f  +  f 


Pi 


1  +  Af.  +.J2 1  H 

3  +  4f  +  f  . 


For  k  -  3S 


•[ 

-t 


(1  +  3f+  f) 

(3  +  4f  +  p)  +  1 


1  +  6f  +  5f2  +  f 3 
3  +  4f  +  P 


■]* 

■]* 


*3 


Pi 


,r _ i* 

1_  1  +  6f  + 

-[» 


Of  +  5f2  +  ,f3 


5p  +  p  +  3  +  4f  +  P 


Of  ±  Sf2  +  f3 


io f  +  ep  p 


]* 


1  +  Of  +  5f2  +  ,f3 
4  +  10  f  +  6p  +  f3 


3b 


For  k  ~  4, 


r  1  +  10  f  +  15  f 2  +  7f\  +  f*  ] 

1.5  V-  20f  +  21  f 2  +  8p  +  f"  J 


Pi,  *  ^4  i? 


Finally,  for  fc  =  5, 


P5 


^  5 


\l  +  15f  +  35  f 2  +  28f3  +  OJ*  +  /51 

L  5  +  20f  +  21 f2  +  8f 3  +  f*  J 

r  1  +  15  f  +  25  fz  +  28  f 3  +  of*  +  f5  J 

[FT 35 f  +  56 f*  +  36 f*  +  10 f"  if*} 


P5  =  Kt  R 


Examining  these  values  leads,  by  induction,  to  the  relationship 


where  (?) '  Tjfrnrr 

the  binomial  coefficients,  and  f  =  Q/R.  P  and  P  are  given  as 

J  m  m  3 

It  Is  important  to  investigate  the  limit  of  Kk  as  k  ■*  » 
to  the  recursive  relations 

**  =  =-*- 
Pk  +  K 


P*'« 


then 


\ 


(1.4.27) 


before. 

Returning 
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K-H 

L  k  +  r\ 


Pk  +  Q 


(1.4.28) 


Pu  +  R 
k 


Now,  as  k 


- 

Poo  =  = - 

Pm  +  * 


(1.4.29) 


where  the  notation  P„  means 

00  «-»“>  k 


Solving  for  P  , 


p  Q  ±  V  Q2  +  4RQ 
•  "  2 

where  the  +  sign  Is  taken  since  It  Is  required  that  p  >  o.  However, 


(1.4.  SC) 


”  =  (Q  +  /  Q2  +  4QR  )/2 

P  +  R  (Q  +  /  Q2  +  4QR  )/2  +  R 


1  +  V  1  +  4/ f 
1  +  ✓  1  7  4/ f  +  2/f 


( 1.4.81 ) 


and  therefore, 


t  Vi-1  /> 

l  1  +  /  1  +  4/f  f  2/f  J 


(1.4.82^ 


Noting  that  K  <  1,  and  P  <  P,  one  may  write 


K»<Kk<  7 

P  <  P,  <  R 
00  k 


(1.4.88) 
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The  fact  that  P  and  K  do  not  vanish  due  to  the  presence  of  Q  does 
not  in  qeneral  prevent  divergence  of  the  estimate,  when  unbounded  model 
errors  are  present,  but  simply  delays  it.  Forming  the  estimate  for  very 
large  k  ,  one  obtains  approximately, 

a  ™ 

P*  -  V  -  *»]  Pk  +  A^fPoY  trinh  Y tR  +  p0  <*.’<»//  Y ‘  k  +  i’^I, 


r — —j/L— — i  • 

|_I  *  Jl  *  4/f  t  2//J 

r  1  rptY H 

l_7  +  fmjf  *■  2/f  j 


sink  y t.  +  Po  aoeh  y t  +  t»  ) 

K  K  K 


( 1.4.34 ) 


The  corresponding  error  is  approximately 

p  ~  - - -  fp0y  sink  y t  +  p 

K  1 1  +  Vl  +  4/f f  2/f  ]  K 


Po  cosh  y tk  -  p^; 


[1  +  Vl  +  4/f  ] 

1  +  Vl  +  4/f  +  2/f  J 


(1.4.  M) 


Thus  the  non-zero  steady-state  gain,  K^,  continues  to  provide  information  to 
improve  the  estimate  of  p.  Note  that  as  Q ,  and  hence  /,  becomes  very  large, 
the  error  asymptotically  approaches  the  error  due  to  the  observation  noise. 
However,  after  a  sufficient  amount  of  time,  the  estimate  still  diverges  owing 
to  the  unboundedness  of  the  model  error. 

The  question  remains:  "How  does  one  compensate  for  modeling  errors  so 
as  to  allow  a  workable  estimation  algorithm  which  provides  accurate  and  useful 
estimates?" 


1.5  Literature  Survey 

The  effects  of  dynamic  model  errors  have  been  examined  by  a  number  of 
investigators,  among  them  Heffes  (10),  Schlee,  ct  aC.  (11),  Price  (12), 
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Huddle  and  Wismer  (13),  and  Neal  (14).  Most  of  these  Investigations  are 
concerned  primarily  with  performance  degradation  of  the  estimation  process 
(which  Is  the  Kalman  sequential  estimator  in  all  cases),  and  offer  equations 
which  provide  a  measure  of  the  degradation.  These  equations  are  generally 
based  upon  the  error  covariance  and  are  structured  in  such  a  way  that  model 
errors  may  be  seen  through  their  effects  on  the  error  covariance. 

As  mentioned  In  Section  1.1,  model  error  compensation  techniques  may 
be  classed  loosely  as  adaptive  or  non-adaptlve.  Techniques  from  both  camps 
may  be  further  classed  according  to  their  basic  approach  to  the  problem.  On 
the  one  hand  are  techniques  which  simply  alter  the  actual  estimation  or  filter¬ 
ing  equations.  These  approaches  generally  attempt  to  maintain  the  error 
covariance,  and  hence  the  gain,  at  a  level  which  will  continue  to  provide 
corrective  Information  for  the  state  estimate.  On  the  other  hand  are  found 
methods  which  attempt  to  improve  knowledge  of  the  dynamic  model.  The  former 
approaches  usually  aim  at  computational  simplicity,  but  pay  the  price  by 
compromising  optimality.  The  latter  methods  are  usually  more  in  keeping  with 
the  optimality  properties  but  generally  are  computationally  more  demanding. 

The  technique  of  representing  modeling  errors  as  white  noise  Is  an 
easily  implemented  approach  which  is  perhaps  the  most  conservative.  Numeri¬ 
cally,  it  amounts  to  merely  increasing  Pk  by  adding  the  Qk_1  matrix  to  4P  $T 
As  pointed  out  in  the  last  section,  this  merely  delays  divergence  if  the 
errors  are  unbounded,  but  can  be  effective  for  small,  bounded  errors,  i.e., 
if  the  system  is  uniformly  completely  controllable.  The  consideration  of 
state  noise  is  employed  so  often  that  it  has  become  part  of  the  standard 
linear,  unbiased,  minimum  variance  estimator  (5,  194-209),  (15).  However,  in 
the  face  of  incomplete  knowledge  of  the  dynamic  model,  Qk  must  be  guessed. 
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Rather  than  arbitrarily  select  a  complete  matrix  of  values  to  be 
added  to  the  error  covariance,  an  effective  and  simple  method  is  simply  to 
scale  the  error  covariance  by  a  scalar.  This  has  been  done  recently  by  Tarn 
and  Zaborszky  (16).  A  scalar,  N  /,  is  selected  and  used  to  scale  !'  after 

K 

each  covariance  update.  In  the  application  to  an  inertially  navigated  glide 
vehicle,  it  was  concluded  that  /  •  n  <  7.5,  and  that  satisfactory  results 
were  obtained  for  s  =  l.n. 

Fagln  (17)  introduced  the  concept  of  exponential  age-welghtinq  of  the 
observations.  The  effect  is  to  downgrade  the  value  of  old  observations  so 
that  the  most  current  information  dominates  the  estimate.  Fagin's  development 
is  based  upon  a  recursive  least  squares  derivation,  where 

J k  =  QT(Os  k)  P~°  $( o ,  k)  + 

k  _ 

%  E  [yi  -  II i  *(i>  k)  xkf  p i  [yi  -  //j  *U,  k)  X  ]  (l.b.l) 

1^1 

is  minimized  with  respect  to  .r^.  In  the  standard  least  squares  approach, 

R’1  =  R ~1.  However,  Fagin  modifies  h.  as 

R ^  =  exp  [(tk  -  tj/i]  R^  i  >_  k  i  1. 

where  t  is  an  arbitrarily  chosen  time  constant.  The  resulting  estimation 
equations  have  a  form  different  from  those  of  Tarn  and  Zaborszky;  however, 
they  are  equivalent  (this  will  be  shown  in  Chapter  2  along  with  certain 
requirements  on  the  scaling  factor). 

Miller  (18)  examines  the  behavior  of  the  Kalman  filter  for  continuous 
and  discrete  time  invariant  systems  with  exponentially  age-weighted  observa¬ 
tions.  He  derives  equations  for  the  filter  eigenvalues  to  provide  useful 
guides  for  choosing  the  aging  time  constant,  x. 
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Schmidt  (19),  (20)  offers  two  unique  methods  for  overcoming  model 
errors.  One  utilizes  a  scalar  parameter  />  to  scale  the  optimum  gain,  a,  with 
an  attendant  increase  in  the  updated  covariance  matrix.  A  second  technique 
adds  a  judiciously  selected  term  to  the  Kalman  gain  with  the  corresponding 
effect  of  an  additive  term  to  the  updated  covariance.  In  both  methods  the 
values  of  the  parameters  involved  are  related  to  certain  configurations  of  the 
Kairnan  estimation  equations.  Both  techniques  effectively  over-weight  the  more 
recent  data. 

Limited  memory  filtering  is  a  useful  technique  for  counteracting  model 
errors.  Here,  only  a  limited  batch  of  observations  are  employed.  However, 
the  "batch"  is  updated  in  the  sense  that  old  observations  are  effectively 
discarded  as  new  observations  are  added.  Hence  the  dynamic  model  is  required 
to  be  commensurate  with  the  data  only  over  a  short  time  interval.  While  a 
number  of  limited  memory  filters  have  been  developed,  one  of  the  most  efficient 
and  easiest  to  Implement  is  due  to  Jazwinski  (21). 

Adaptive  techniques  form  the  majority  of  model  error  compensation 
methods.  Here  attempts  are  made  to  estimate  a  parameter  vector,  a,  whose 
elements  are  unknowns  in  the  model.  These  may  include  elements  in  <t>,  H,  Q, 
or  R.  Mehra  (22)  places  adaptive  methods  into  four  categories. 

1.  Bayesian 

2.  Maximum  likelihood 

3.  Correlation 

4.  Covariance  matching 

Bayesian  methods  involve  determining  the  a  poitoiioni  probability 
density  function,  p(xk>  a \Vk).  Employing  Bayes'  Theorem, 
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p(V.\ a)  p(a ) 

P(xk. ,  a\Vk )  =  p(xk\a,  Vk)  p(a\Vk)  =  pfxja,  VJ  - - -  (1.5,*) 

Since 

p(Vk)  *  f  p(Vk\a)  p(a)  da, 

then 


or 

xk  =  J  &k( a. )  p(a\Vk)  da  (I.*.?) 

A 

Using  the  quotient  in  (1.5.4)  substituted  for  p(a\V  )  completes  the  formu- 

/w 

lation  of  the  algorithm. 
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The  obvious  difficulties  In  (1.5.4)  are  the  evaluation  of  the 
Integrals  over  A.  Magi'll  (25)  develops  the  above  algorithm  approximating 
the  Integrals  by  suimatlons  of  discrete  probability  distribution  functions. 
Thus, 

**  •  i:  w  p(°iK> 

i-1 

and 


V  r±iK-ij 

t,  M'kK-i •  V  WajlVi' 

J-2 


T 


where  (1.5.5)  has  been  employed.  This  algorithm  can  be  effective  if  the 
dimension  of  a  Is  not  too  large.  To  implement  the  procedure,  the  values  of 
the  ai  must  be  precomputed;  therefore,  large  n  increases  the  computational 
load.  Further,  the  appropriate  probability  density  and  distribution  functions 
must  be  assumed. 

Maximum  likelihood  estimation  is  based  upon  maximizing  a  likelihood 
function,  L,  formed  as 
L  -  In  p[xk ,  al^J 

with  respect  to  the  state,  x,  and  the  parameter  vector,  a.  Taking  partial 
derivatives  with  respect  to  the  appropriate  elements  produces  a  set  of 
usually  nonlinear  algebraic  equations.  Thus  the  disadvantage  here  is  that 
an  Iterative  method  Is  required  to  solve  for  the  estimates,  or  else  some 
approximation  must  be  made  which  yields  suboptlmal  estimates.  Further,  the 
density  function  must  be  known,  a  pfiiofu..  Alternately,  using  the  marginal 
density  function,  p(a\Vk),  It  is  sometimes  easier  to  derive  an  estimator  for  ,7. 
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The  basic  approach  of  correlation  methods  Is  to  relate  the  auto¬ 
correlation  function  of  the  observations  to  the  unknown  parameters.  These 
methods  are  normally  applicable  only  to  constant  coefficient  systems  and 
therefore  will  not  be  discussed  further.  The  Interested  reader  should  see 


(29)  and  (30). 


Covariance  matching  techniques  attempt  to  make  the  observed  residuals 
match  their  theoretical  covariances  through  the  appropriate  choice  of  the 
unknown  parameters.  Approximating  the  theoretical  covariance  of  the  residuals 
by  the  sample  covariance,  one  has 


v  - 


V 

m 


m 


i 


) 


where  /  Is  the  theoretical  covariance,  v  is  the  sample  variance,  and  y  is  the 
vector  of  observation  residuals.  The  limit,  n,  is  chosen  to  provide  a  sem¬ 
blance  of  smoothing.  Equating  v  and  7, 

V  -  Hk  4>r  +  TQk  rT]  l?k  +  R  =  Vk  (1.5.  .v> 

Care  must  be  taken  to  insure  rank  is  consistent  with  the  number  of  unknowns. 
For  example,  if  Q  is  being  estimated,  HY  must  be  of  equal  or  greater  rank 
than  Q  in  order  to  obtain  a  unique  solution  for  the  elements  of  Alter¬ 
natively,  additional  equations  may  be  obtained  for  different  times.  For 
elements  of,  say,  <t>,  (1.5.10)  is  nonlinear,  hence  an  iterative  method  is 
generally  required.  In  passing,  it  is  mentioned  that  covariance  matching 
techniques  appear  to  give  fair  results  (24),  and  seem  to  be  easiest  to 
implement. 

The  adaptive  techniques  discussed  above  have  been  used  extensively  to 
estimate  the  state  noise  covariance  matrix,  as  well  as  other  parameters.  We 
have  already  mentioned  Magi  1 1  * s  use  of  the  Bayesian  approach  to  develop  an 
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adaptive  algorithm  for  estimating  parameters  which  are  allowed  to  take  on 
only  a  finite  number  of  values. 

Maximum  likelihood  estimates  of  the  state  noise  covariance  are 
obtained  by  Abramson  (26),  Sage  and  Husa  (27),  and  Levy  (28)  to  name  a  few. 
Abramson  provides  a  complete  and  detailed  development  of  optimal  and  sub- 
optimal  methods  for  simultaneous  estimation  of  the  state  and  of  the  noise 
statistics.  While  his  approach  offers  estimates  of  the  diagonal  elements  of 
Q ,  Sage  and  Husa  (27)  have  extended  the  approach  to  yield  estimates  of  all 
elements  of  Q,  although  proof  of  convergence  is  not  established.  Using  the 
Sage-Husa  algorithm.  Levy  has  constructed  a  reprocessing  filter  which  period¬ 
ically  reprocesses  the  accumulated  data  to  obtain  increasingly  improved 
estimates  of  Q  (and  r)  .  In  the  same  work.  Levy  points  out  some  shortcomings 
of  the  Sage-Husa  approach,  and  subsequently  produces  a  corrected  iterative 
algorithm  for  simultaneous  estimation  of  Q  and  r. 

Mehra  (29)  uses  the  innovations  correlation  approach  to  provide 
estimates  of  Q  and  r  for  time  invariant  systems.  The  method  is  limited  to 
cases  for  which  the  number  of  unknown  elements  of  Q  is  less  than  >;  x  p, 
where  n  is  the  state  vector  dimension,  and  p  is  the  observation  vector 
dimension.  In  cases  where  this  restriction  is  violated,  the  Kalman  gain 
may  be  estimated  directly,  although  this  alternative  utilizes  an  iterative 
approach.  In  (30)  Mehra  applies  the  observations  correlation  approach  to  the 
identification  of  time  invariant  system  parameters. 

Jazwinski  (31)  uses  a  maximum  likelihood  approach  to  develop  a  state 
noise  covariance  estimator.  The  joint  probability  density  function  of  m 
residuals  is  maximized  with  respect  to  Q.  For  the  case  of  one  residual, 
normally  distributed,  the  result  is  identical  with  that  for  covariance  match¬ 
ing.  One  residual  is  not  a  meaningful  statistical  sample;  however,  using  the 
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sample  variance  of  a  larger  residual  sample,  Jazwinski  has  obtained 
satisfactory  results  (24). 

An  effective  approach  to  adaptively  compensating  for  modeling  errors 
Is  to  consider  the  state  noise  as  being  correlated  in  time,  and  modeled  by 
the  first  order  Langevin  differential  equation, 

u(t)  =  Cu(t)  +  u(t)  (1.5.12) 

where  C  is  a  diagonal  matrix  of  constants,  and  u(t)  is  a  white  noise  process 
with  statistics, 

E[u(t)]  =  0 ,  E[u(t)  uT(s)]  -  U(t)  &(t  -  8) 

By  extending  the  state  vector,  x,  to  include  the  elements  of  u,  the  problem 
resumes  its  original  form  with  the  only  difference  being  a  larger  state 
vector.  The  specification  of  c  is  generally  not  an  easy  matter;  however,  by 
also  including  its  elements  in  the  state  vector,  the  value  of  c  may  be 
adaptively  estimated.  This  approach  has  been  used  successfully  by  Ingram  (32) 
in  representing  the  effects  of  time  correlated  random  accelerations  acting 
on  the  Apollo  spacecraft.  Schutz  (33)  has  used  the  same  approach  to  account 
for  the  effect  of  mascons  in  the  lunar  gravitational  field.  Tapley  and 
Hagar  (34)  have  successfully  utilized  equation  (1.5. U)  as  well  as  the  second 
order  equation, 

u(t)  =  cu(t)  +  u(t)  (i.f.  :r 1 

to  represent  time  correlated  uncertainties  in  the  thrust  acceleration  vector 
of  a  continuous  low  thrust,  solar-electric  spacecraft.  The  possibility  of 
utilizing  higher  order  models  to  represent  time  correlated  noise  is  also 
Indicated  in  (34). 

A  particular  disadvantage  of  each  of  these  techniques  is  that  unless 
the  constant  matrix,  c ,  has  some  state  noise  variance  associated  with  it, 
it  will  settle  to  some  constant  value  since  its  associated  gain  will  vanish. 
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As  a  result,  unless  the  correlated  noise  represents  the  model  errors  exactly , 
divergence  of  the  estimate  will  eventually  occur  (35). 

It  Is  obvious  that  there  Is  great  latitude  In  selecting  functional 
forms  to  represent  auto-correlated  model  errors,  and  these  range  from  the 
use  of  simple  constants  by  Russell  and  Curkendall  (36)  to  higher  order 
differential  equations  (37).  An  obvious  problem  is  that  of  selecting  the 
correct  functional  form  or  structure.  This  problem  has  been  touched  on 
briefly  by  Lainiotis  (38),  and  will  be  considered  further  in  Chapter  3. 

1.6  Outline  of  the  Investigation 

As  stated  In  Section  1.1,  this  study  Investigates  the  utility  of  a 
variety  of  model  error  compensation  techniques  and  provides  a  comparison  of 
the  effectiveness  of  these  methods.  The  particular  techniques  Investigated 
are  selected  from  those  mentioned  in  the  previous  section,  along  with  several 
extensions  and  new  approaches.  The  study  is  limited  to  implementation  using 
the  Kalman  sequential  estimators  (Algorithms  I  and  II)  presented  in  section 
1.2,  and  the  error  compensation  methods  studied  are  general  enough  to  be 
applicable  to  time  varying  as  well  as  time  invariant  systems. 

In  Chapter  2  the  suboptimal  non-adaptive  methods  of  the  Schmidt 
modifications  and  of  age-weighting  devices  are  presented.  Also  presented 
in  this  chapter  is  Jazwinski's  limited  memory  filter  algorithm. 

Chapter  3  is  concerned  with  adaptive  methods,  both  optimal  and  sub- 
optimal.  Jazwinski's  adaptive  technique  for  estimating  the  state  noise 
covariance  is  presented.  Several  extensions  are  given,  and  the  covariance 
matching  technique  applied  to  yield  adaptive  forms  for  the  Schmidt  and  age¬ 
weighting  algorithms  presented  in  Chapter  2.  The  briefly  mentioned  problem 
of  structurally  adaptive  filtering  is  considered,  and  results  are  obtained 
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for  certain  restricted  forms.  A  new  method  is  developed  for  estimation  the 
state  noise  covariance  matrix.  The  resulting  algorithm  employs  the  linear 
minimum  variance  sequential  estimator  to  adaptively  estimate  the  state  noise 
covariance  elements  for  the  case  of  linear  observations.  The  same  approach 
is  shown  to  be  capable  of  modifying  the  error  covariance  elements  directly. 

Chapters  4  and  5  apply  the  previously  presented  algorithms  to  two 
dynamic  systems.  The  scalar  rendezvous  problem  presented  in  section  1.4  is 
used  to  provide  tractable,  closed-form  solutions  and  to  offer  insight  into 
the  nature  of  many  of  the  methods.  In  Chapter  5  a  second  problem  of  a  more 
complex  but  realistic  nature  is  investigated  using  the  most  promising 
approaches  indicated  In  Chapter  4.  This  concerns  estimating  the  heliocentric 
trajectory  of  a  continuous  low  thrust,  solar-electric  spacecraft,  subject  to 
errors  in  the  thrust  acceleration  vector.  A  complete  description  of  this 
problem  is  given  In  Chapter  5. 

Chapter  6  Is  a  concise  summary  of  the  investigation  and  presents  the 
general  conclusions  and  recommendations  for  further  study. 


Chapter  2 


NON- ADAPTIVE  METHODS 


2.1  Age-Weighting  of  Data 

The  rationale  for  age-weighting  the  observation  data  to  compensate 
for  model  errors  Is  this:  The  Kalman  filter  uses  all  the  data  as  information 
for  obtaining  an  estimate.  However,  the  dynamic  model  assumed  in  the  process 
is  in  error  and  therefore  yields  a  reasonable  approximation  to  the  true 
motion  over  only  a  finite  interval  of  time.  Therefore,  attempting  to  make 
the  data  consistent  with  an  erroneous  model  over  the  total  estimation  period  is 
unreasonable  over  a  long  period.  A  logical  alternative  Is  to  downqrade  the  impor 
tance  of  the  older  data.  Fagln  (17)  does  precisely  this  by  minimizing  ,*  In 
equation  (l.fi.i)  where  /?.  =  erp[(tk  -  t  )/  t]  R. ,  with  t  arbitrarily  chosen. 

The  results  are  equations  (i\  I.R)  with  «  =  -  t  .)/\\. 

K  1 

For  the  derivation  here,  a  slightly  different  and  more  direct  approach 
due  to  Tarn  and  Zaborszky  (16)  is  given.  Here,  a  general  function,  <?,  is  used 
as  the  weiqhtinq  factor  instead  of  the  special  case  of  an  exponential.  Pro¬ 
ceeding  heuristically,  it  is  noted  that  aging  the  old  data  (decreasing  its 
importance)  will  yield  the  same  results  as  increasino  the  importance  of  the 
more  current  data.  This  may  be  accomplished  simply  by  scaling  the  error 
covariance  by  s,  with  a  >  l.  Thus  the  error  covariance  and  hence  the  gain 
is  increased  so  that  more  importance  is  attached  to  the  more  recent  observa¬ 
tions.  The  subo  timal  covariance,  denoted  as  f? ,  is  aV  with  the  kalman 
equations  modified  by  simply  substituting  for  r.  Thus, 
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St  =  x.  +  K.  [u.  -  H  x.  ] 

k  k  klJk  k 

Pks  =  8(i  -  kh)  Pk 
where  n  =  exp  [(t  -  t  .  )/x], 

AC  Ac- i 

The  effects  on  the  age-weighting  algorithm  for  various  values  of 
are  now  investiqated.  First,  if  8=1 ,  equations  and  i':'.  are 

just  the  Kalman  equations  of  Algorithm  I.  If  a  is  very  large,  the  gain 
approaches  a  value  such  that  HK  =  l ,  in  which  case  only  the  most  recent 


observation  Is  considered  In  forming  the  estimate.  This  latter  fact  is  seen 
by  premultiplying  (1.2.8)  by  H. 


To  gain  further  insight  Into  the  algorithm,  (2.1.2)  is  used  to  form 
the  variance  between  a  perfect  measurement  and  a  computed  measurement  (for 
a  single  observation),  HFSHT.  For  a  perfect  observation,  y  =  Hx\  for  the 
computed  observation,  y  =  HSt.  Differencing  these  and  taking  the  expected 
value  gives 

y  =  y  ~  y  =  H(x  -St)  =  II  x 
K[y  f  1  =  H  T?  ht 

which  represents  a  measure  of  the  estimation  performance  for  comparison  with 
the  Kalman  filter.  For  the  case  of  scalar  observations  (to  which  the  vector 
case  can  always  be  reduced),  (2.1.2)  is  used  to  obtain 

//  ps  ht  =  s  \h  p  ht - ny.lLJ.1 —  1  , 

L  <7/  r  ir  +  E/a)  J 

Introducinq  the  definition 

<•  •-  h/h  p  nT  >  o,  (2.1. ■:) 

equation  (2.1.3)  becomes 


For  the  optimal  Kalman  filter,  s  =  l,  and 

II  P  llT  = 

l  +  o 

Forming  the  ration  of  (2.1.5)  and  (2.1.6), 


sR 


1  +  0  s  +  sc 


8+0 


R 


8  +  C  — 


>  1,  8  >  1 


(  V  1  7 1 

\  •  »  •  *  / 


Thus,  the  measure  of  performance,  given  by  HT^Ht  for  age-weighting,  is 
bounded  below  by  that  for  the  optimal  filter.  To  obtain  an  upper  bound, 
consider  (2.1.5)  in  the  limit  as  a  +  ».  Employ  [.'Hospital's  rule  to  find 


8R 

8  f  (J 
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* 


Urn  H  F*  HT  «  lim 


R 


(2.  1.6) 


g-*to 


Q-HO 


The  values  of  HPkHT  are  then  bounded  according  to 

0  <  PirS  UT  <  R,  8  >  1  i 2.1.0} 

for  k  >  l  (after  the  first  observation). 

The  variance  of  perfect  measurements,  hpht ,  forms  a  useful  common 
denominator  for  comparison  among  various  algorithms.  Letting  this  quantity 
be  denoted  by  c,  the  corresponding  relationship  with  n  inav  be  found.  Again 
considering  the  case  of  scalar  observations,  equation  is  manipulated 

to  give  the  following  sequence: 

h  p  ht  =  C  =  <?[//  T  ht  -up  ht(h  p  ht  +  -  f1  h  p  ht] 

(1 


c(H  P  flT)~1  =  a[l  -  H  P  UT(U  P  HT  +  -  J'J] 

8 

e  +  c(H  P  H T)~1  -  =  b(H  P  llT  +  -  )  -  8  HP  HT 

3  S 


e  c  +  c(H  P  H7)-1  R  =  P  s 


or 

8 '777  \rJ77s] 

This  result  is  consistent  with  (2.1.9)  as  can  be  seen  by  solving  (2.1.10) 
for  e  and  noting  the  results  as  s  -*•  l  and  s  ■*  «.  Note  that  as  e  ♦  r,  *  -*• 
and  the  estimate  depends  more  and  more  upon  the  latest  observations. 

Implementation  of  the  standard  age-weighting  algorithm  is  straight¬ 
forward  (Figure  2.1),  and  may  be  done  using  either  equations  (2 .1.1)  or 
(2.1.2)  . 
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2.2  Schmidt  Suboptlmal  Filter;  Scaling  the  Gain 

Schmidt  (19)  considers  modifying  the  gain  term  directly  using  a 
scaling  constant,  b.  The  state  estimate  Is  then  given  by 

Xk=Xk+b  KkSyk  "  H  xk^  (2.2.1) 

and  the  corresponding  error  covariance  can  be  found  from 

**  "  x*  -  *k  "  **  ’  b  Wk  ’  H  xk]  =  xk~h  Kk[H  xk  +  vk]  (s-s'r) 


l‘k  =  V  -  b  Kk  H 1  l'k[l  -  /•  Kk  H]T  +  b2  K  R  KkT 


«  P  -  b  K  H  P  -  b  P  HT  KT  +  b2  K(H  P  llT  +  R)  KT 


or 


Pk  =  {I  -  (2b  -  b2)  Pk  HT[H  Pk  HT  +  R]'-1  H }  PR 

°roceeding  In  a  manner  similar  to  that  in  section  2.1,  Schmidt  forms 
//  p  iit  =  //  p  nT  .  (?j,  -  b9)  ii  r  ht(h  p  iit  +  r)~1  h  r  ht 
Considering  scalar  measurements  as  before,  Schmidt  specifies  ///^ //r  at  some 
acceptable  value,  e,  and  subsequently  solves  for  />,  yielding 

b-ity 


,-l 


(c.r.c) 


i  r.  ;  ’ 


e(H  r  ht  +  R)  -  r  n  p  ht 


(H  P  ilT)  2 


( cs  %>  :  ) 


for  which  real  solutions  exist  when 


>  h  r  . 

[//  P  HT  +  R  J 


R  H 


and  the  positive  sign  in  (2.2.5)  is  taken  to  insure  b  _> 

The  effects  of  various  values  of  b  may  be  seen  readily  by  examining 


equation  (2.2.3). 
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b  =  1  Standard  (optimum)  Kalman  gain  and  error  covariance  update 

equation 

1  <  b  <  2  A  smaller  than  optimum  term  Is  subtracted  from  Pk  in  forming 
the  updated  covariance,  Pk 


b  ■ 2  - p* 

b  >  2  A  positive  term  Is  added  to  Pk 

Extending  Schmidt's  analysis,  corresponding  values  of  c  are  easily  found  to 


for  b  -  1 


<  e  <  H  P  HT  for  1  <  b  <  2 

e  =  HP  HT  for  b  =  2 

e  >  HP  HT  for  b  >  2 

Further,  values  for  b  and  c  may  be  found  which  yield  HK  -  i,  thereby 
forcing  the  estimate  to  depend  only  upon  the  more  recent  observations.  In 
this  case, 

b  H  K,  «  b  HP  HT(H  P  HT  +  R)'1  <  1 

k  — 

or 

hpht  +  r 

b  <  - - -  ('2.  r.  ~ 

—  —  T 

H  P .  H 
k 

Using  (2.2.5)  and  (2.2.7) 3 


f  H  P  HT 
\hpht  +  r 


H  P  HT  +  R  >  z(H  P  HT  +  R)  -  R  H  P  H1 
H  P  HT  ~  (H  P  KT) 2 


one  may  solve  for  e  to  obtain 


ST 


In  view  of  C'.r.n),  < u)  requires  that 


R 


—  T 
H  P.  II 
k 


R  +  H  P.  RT 
k 


<  e  <  R 


(2.2.9) 


or 


R  H  K  <  t  <  R 


(2.2.10) 


For  estimates  based  upon  only  the  current  observation,  equality  must  hold  in 
(2.2.8).  It  may  be  recalled  that  this  agrees  with  result  obtained  for  the 
age-weighting  algorithm. 

Implementation  of  this  algorithm  is  straightforward.  A  slightly 
different  approach  involves  using  the  following  modified  form  of  f.*?. :) : 


b  =  a 


HP,  Hr  +  R 
k 


H  P.  HT 
k 


(*>  "  ;  / ) 


where  HK  <_  a  <_  1.  The  lower  limit  for  a  follows  from  the  requirement  that 
b  >  l.  When  applied  to  the  Kalman  gain,  the  following  result  is  obtained: 

r  =  b  k  =  a  I  ------- 1  — - =  --  » a  rr.  r.  i::) 


"h  p  ht  +  /?! 

a  — — — 

L  H  P  H1  J 


(H  P  uT  +  R)  HP  ht 


and  the  covariance  update  equation  (::.  becomes 

P  =  P  -  [2a  -  (1  +  )  u2J  r  -l--z 

H  p  a  ii  r  ir 


/  o  •  •  \ 

l  .  ■  •  .  •  ‘ » 


Suppose  a  =  c/l i.  From  the  limits  on  a,  the  corresponding  limits  on  t  are 
seen  to  be  those  in  (2.2.9).  Thus,  e  =  /  results  in  estimates  based  upon 
only  the  current  observation. 

The  gain  scaling  technique  using  a  constant  value  of  !  can  lead  to 
some  problems,  and  this  will  be  shown  specifically  in  Chapter  4.  Another 
alternative  method  is  to  determine  b  as 

b  *  l  +  6  — i-  ■  ,  o  <  6  <.  i  i  r. 

H  P  H 
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The  limits  ori  3  correspond  to  the  expected  limits  for  /•  discussed  earlier. 

In  particular,  3  *  0  gives  b  =  /  resulting  In  the  optimal  Kalman  filter; 

3  =  2  gives  the  value  of  b  Indicated  by  equality  holding  in  (2.2.7),  i.e., 
all  Information  Is  obtained  from  the  most  recent  observation.  Employing 
( 2.2.14 )  In  the  gain  and  the  error  covariance  update  equation  (2.2.2)  gives 
the  following: 

K'  =  b  K 

=  [i  +  -_R  ]  p  nToi  p  ht  +  r)~1 

II  P  H 

=  [p  ht  +  3  R  p  ht/ii  p  nT]  (n  p  uT  +  a)-1  (::.r.  /.-> 


2b  -  b 2 


ii  r  ht 


-  i  - 


h  p  ht 


(H  P  //V 


/i" 

(H  r  iir)  - 


(r..r..JtV 


2  2 

P  =  {I  -  12 - ]  P  P  UT  +  R)"1  //}  P  (2.2.  D 

(R  P  HT)  2 

Forming  HPllT  yields 

h  p  ht  »  (H  +  g2  R'-  r  (:\::. 

h  p  ir  +  r 

It  has  been  Indicated  by  Schmidt  (20)  that  this  method  of  determining 
yields  estimation  results  identical  with  his  additive  gain  teriv  algorithm 
discussed  In  the  next  section. 

The  gain  scaling  technique  for  constant  b  or  (2.2. 14)  to  determine  : 
Is  diagrammed  in  Figure  2.2.  Figure  2.3  gives  the  logic  flow  for  the  modi¬ 
fied  gain  scaling  algorithm  using  equations  (2.2.12)  -  (2.2.12).  For  this 
modified  approach,  It  Is  possible  that  a  value  of  e  might  be  chosen  which  is 
Initially  less  than  the  lower  bound  given  by  (2.2.9) .  This  is  not  generally 
desirable.  The  philosophy  here  Is  to  operate  with  the  ordinary  Kalman  filter 
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Figure  2.2  Schmidt  Suboptlmal  Filter:  Gain  Scaling 


Given: 


^ 


o ,  k 


max 
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equations,  and  when  HPHT  =  e,  switch  to  the  suboptlmal  modified  gain  scalinq 
equations. 


2.3  Schmidt  Suboptlmal  Filter:  Additive  Gain  Term 

A  second  approach  taken  by  Schmidt  (19)  Is  to  add  a  judiciously  chosen 
term,  L,  to  the  optimal  gain,  K.  This  approach  is  possible  only  for  scalar 
observations.  The  suboptlmal  term  is  given  as 


2^=0 - JT 

A  /ri  rr* 


R  R 


(H  H‘)  [H  Pk  HT  +  R] 


so  that  the  actual  suboptimal  gain  is  found  as 


Mk‘Kk  +  *Lk 

=  P  HT[H  P  HT  +  Rf1  +  3 

A  A 


R  // 


(H  HT)  [H  Pk  HT  +  R] 


If  3  is  chosen  as 

1 11  ~  "T 

3  =  3" 


(2.  3.  I ' 


(2.3.2) 


then 


3  r.  =  3 


-  Hi 

1 

H  // 


If  one  considers  the  estimate  of  x  with  no  a  pni on <  information,  one  obtains 
Bk  ‘  *k f  !»k  - " 

where  //  is  the  pseudo  inverse  (45,  82-89)  of//. 

,/  -  jL 


II  IV 


Then  (2.3.2)  is  simply  a  linear  combination  of  two  gains:  the  optimal  gain 
for  all  the  observations,  and  the  gain  for  no  a  p/uofii  information,  given  by 

U 

the  pseudo  inverse,  H. 
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The  updated  covariance  equation  Is  also  modified.  Uslnq  to 

form  the  state  estimate, 

xk  =  xk  +  Mk\yk  -  H  xk)  (2.3.3) 

and  subsequently  the  estimate  error, 

*k  ■  xk  -  Mk[H  xk  +  Vk]  (2.3.4) 

the  error  covariance  Is  determined  as 

E{xk  xk  }  -  P  -  (I  -  M  U)  P  (I  -  M  H)T 

«  (I  -  M  H)  P  -  P  HT  bF  +  M(H 

But  since 

r 

M(H  P  HT  +  R)  -  (P  HT  +  0  )t  Pu  = 

H  HT  k 

Thus  the  updated  covariance  matrix  Is  of  the 
with  the  exception  that  the  suboptlmal  gain, 
term  Is  added.  An  alternative  form  of  (2.3.5)  may  be  found  as 

Pk  -  (I  -  K  H)  Pk  +  B2  R  L  H/(H  HT)  (2.3.6) 

This  Is  the  form  derived  by  Schmidt. 

Reasonable  limits  for  6  are  readily  established  by  considering  hm, 
formed  using  (2.3.2). 

H  M  *  HP  HT(H  P  HT  +  R)'1  +  Q  R(H  P  HT  +  R)'1 

■  (H  P  HT  +  0  R)  (H  P  HT  f  R r1  (2.  3.  7) 

As  before,  for  hm  -  1,  the  most  recent  observation  drives  the  estimate. 

Hence  using  (2.3.7) 

H  M  <  1  ->  8  £  1 

The  lower  limit  on  8  Is  established  heurlstlcally  by  considering  the  fact 
that  8  <  0  decreases  the  optimal  gain,  an  effect  which  counters  the  Idea  of 


+  M  R  tf 
P  HT  +  R)  tF 

T 

(I  -  M  H)  P  +  8  — |  W3" 

same  form  as  the  optimal  equation 
Mt  Is  used,  and  an  additional 
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depending  more  on  the  recent  observations.  Thus  limits  on  0  are 

0  ^  0  £  1  1  ti) 

For  0  =  0,  M  *  K\  for  3  *  1  only  the  current  observation  is  used  to  determine 
the  estimate. 

It  is  interesting  to  extend  Schmidt's  analysis,  and  find  the  relation¬ 
ship  between  b  and  0.  Forming  hpht  from  equation  (2.3.6), 

H  P  HT  =  HP  UT  -  (HP  IIT)  2  <7/  P  UT  +  R)'1  +  02  R2(H  P  HT  +  R)'1 

Denoting  c  =  HPHT ,  as  in  section  2.2,  an  expression  for  0  may  be  found. 

c(n  r  ht  +  r)  =  (hp  iiT)  (h  p  ht  +  r)  -  (a  r  ht)  2  +  p2  r2 


or 


B  =  i 


1 


c(H  P  IIT  V-  R)  -  H  P  HT  R 


1 . . . 


>) 1 


R s 


where  the  positive  sign  is  taken  in  light  of  (P..S.8).  Now  comparing  this 
with  the  expression  for  b  in  section  2.2,  b  may  be  rewritten  as 


b  «  1  +  0 


R 


—  T 

H  PH1 


( Cl  X  7  1  1 
I  .  i .  0 .  1 1.  , 


Equation  (2.3.10)  is  identical  with  (2.2.14),  and  therefore  provides  the 
link  between  the  gain  scaling  and  additive  gain  term  algorithms.  It  should 
be  noted  that  this  relationship  is  derived  by  forming  HPHt ,  and  then  developing 
the  equation  based  on  this.  By  examining  equations  (2.2.15),  (2.3.2), 

(2.2.17),  and  (2.3.6)  one  will  see  that  these  gains  and  covariances  differ 
by  the  presence  or  absence  of  the  error  covariance  matrix  in  the  suboptlmal 
term.  The  case  where  it  is  present,  i.e.,  the  suboptimal  gain  term  is  given 
as 

.  0  R  P  HT _ 

(H  P  HT)  (HPHT  +  R) 


Is  In  fact  more  general.  This  Is  true  because  when  the  suboptimal  gain  term 
Is 

0  R  HT _ 

(H  HT)  (H  P  HT  +  R) 

any  zero  terms  in  H  prevent  contribution  to  the  corresponding  optimal  Kalman 
gain  term.  Usually  this  does  not  occur  for  the  former  case. 

Note  that  in  deriving  (2.3.10) ,  when  hpht  is  formed  for  each  instance, 
the  differences  in  the  suboptimal  covariance  terms  disappear.  Hence  these 
differences  do  not  occur  in  (2.3.10). 

Conditions  for  $  may  be  established  similar  to  those  for  b  in  section 
2.2.  In  particular, 

0*0  Standard  (optimum)  Kalman  gain  and  error  covariance  update 

equation 


0  <  0  < 


H  P  HT 
R 


0  - 


0  < 


H  P  HT 
R 

hpht 

R 


A  smaller  than  optimum  term  is  subtracted  from  in  forming 
the  updated  covariance,  Pk 


A  positive  term  is  added  to  ?k 


Employing  (2.3.8)  and  (2.3.9) ,  limits  may  be  found  for  e. 


* 


3  =  P  HT  +  R)  -  H  P  Ht  r  <  2 

Rz 


or,  using  the  Inequality, 

e  <  R  (2.3.11) 

which  Is  just  the  condition  established  in  section  2.2. 

As  In  the  previous  section,  real  solutions  require  the  radicand  to 


be  positive;  l.e., 
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—  7* 

/»  //  r  n 

\  •  — ~ — - - 

(//  /■  //r  f  to 

which  is  just  the  inequality  (z.x.O. 

The  implementation  of  this  algorithm  is  given  in  Figure  2.4. 


2.4  Limited  Memory  Filter 

The  philosophy  behind  the  limited  memory  filter  is  similar  to  that 
of  the  suboptimal  techniques  presented  in  the  previous  sections.  However, 
rather  than  weight  the  data  In  some  arbitrary  sense,  limited  memory  filters 
"discard"  old  observations. 

Jazwinski  (21)  develops  a  limited  memory  which  is  suboptimal  only  in 
the  sense  that  it  does  not  take  into  account  the  information  from  all  observa¬ 
tions  in  forming  the  state  estimate.  However,  the  filter  equations 
developed  are,  in  fact,  optimal  over  the  set  of  observations  considered. 

In  other  words,  for  some  subset  of  the  total  observations,  the  limited  memory 
filter  provides  a  linear,  minimum  variance,  unbiased  estimate  of  the  state. 

Jazwinski 's  derivation  applies  to  the  general,  nonlinear  case.  However, 
it  requires  assumption  of  the  probability  density  function.  He  also  shows 
that  the  linear  limited  memory  filter  may  be  derived  from  least  squares  con¬ 
siderations.  Since  concern  here  is  with  the  linear  problem,  this  approach 
is  sufficient  for  our  purposes.  The  derivation  is  duplicated  here. 

Consider  the  estimate  of  the  state  at  time  t,  based  on  observations 

k 

throuqh  time  t  .  As  defined  in  Chapter  1,  this  is  denoted  as  i, i  .  The 

m  r  k  |m 

least  squares  estimate  of  x  at  t  based  on  observations  throuqh  time  r  is 

A  \ 

derived  (5,  206)  as  follows:  Form  the  performance  index,  ^  ,  as 

>1  -  fg(x  -  x  |  )T  l~ \  ( x  -  x  |  )  + 

k  m  mm  mmm  mm 


i  £  (y  •  ~  H  •  x  A’ .  (y  .  -  // .  .r , . 
i*ml  1  11  11  11 
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Recall  Inn  that  x,  =  x  , 

k  k,m  m 


•/.  =  H(x.  -  B.  |  )T  4?  P,  i.  + 

k  k  /c|m  k,m  /n|m  k,m 


*  (yi  "  ®i,*  V  Ri  (iji 


H.  *.  .  X.) 
i  i,k  k 


(2.4.2) 


Further,  define 


y  » 


ym+l 

■»  M 

H  Q 

m+1  m+1  ,k 

y  m+2 

H  <J> 

m+2  m+2 ,k 

• 

,  H  = 

• 

• 

• 

A  . 

and 


p'1 


/?_i  • 
rofl ' 


IP?"-1 


m+2 


Then, 


i - 1 

i  _  i  i 
Id  1  I 
I^L.  I 
l_*  J 


/  n  f  r  | 
(  U  .  'i  •  0 


4  -  -  \|/  p;j„^  -  a*|";+  -  «  VT  *  'J'."  - »  V 


4. 


Next  take  the  gradient  of  <7^  with  respect  to  x^,  and  set  it  equal  to  zero. 


<5J.  =  P~J  (x.  -  )  -  HT  R  ~2(y  -  H  xj  =  0 

k  /c|m  k  Zc|m  3  k 

Solving  for  xR  =  we  obtain, 

x,  | .  =  [P'-f  +HTR~2  H]~2  [HT  R'1  y  +  P,  TJ  x,  |  ] 
k  |m  J  1  3  /c|m  *|mJ 


i  °  J  ^  1 
(  a  •  'i  •  0 


or 


(2.4.6) 


66 


The  fact  that  /'.  i,  =  [r~\  +  Ji1  h  ~l  ti]~l  follows  from  equation  n.z.'jj. 

k\k  k\m 

Notice  that  if  there  is  no  a  pfUofu.  information  about  x,  then  =  o 

1  k  k  |m 

and  (2.4.6)  reduces  to  the  standard  weighted  least  squares  algorithm.  In 
this  case,  letting  N  =  k  -  m,  equation  (2.4.6)  is  used  to  obtain  * 

A.  .-T  —  -i  -.-2  r.T  —  -1  — 

xk\(N)  *  W  H  H 1  H  R  y 

or 


Xk\(N)  =  Pk\(N)  R  R  y 

Now  combining  equations 
sequence: 

Xk\k  =  Pk\k^Pk\(N)  Xk\(N) 
Pk\k  Xk\k  =  Pk\(N)  Xk\(N) 
Xk  |  (N)  =  Pk  |  (N)  Vk\k  Xk\k 


i  :\  i.  /’ 

(2.4.6)  and  (2.4.7)  one  obtains  the  following 


+  P*jm  £k\m 

■  **|J 


(r.4.v 


wi  th 

Pk  |  (N) 


k  |mJ 


.?  .1 1 

/ . . 


The  limited  memory  filter  equations  described  above  are  obviously 
more  complex  to  implement  than  the  previously  presented  filtering  algorithms. 
Essentially  two  Kalman  sequences  are  required  to  be  run  for  each  "batch"  of 
N  observations,  and  three  matrix  inverses  are  required  every  .v  observations. 
Further,  the  limited  memory  filter  estimate  is  obtained  only  every  .v 


*  x,  i  is  the  state  estimate  at  t,  based  on  observations  to  t,  ,  less 
k  |  (N)  k  k 

those  to  t  ,  l.e.,  on  the  batch  of  the  last  N  =  k  -  m  observations.  The 
m 

same  meaning  applies  to  the  error  covariance, 


67 


observations.  This  number  may  be  Increased,  but  only  at  the  expense  of  a 
larger  number  of  Inverses  of  pk\k>  pk\j„>  and  pk\ (Ny 

It  Is  possible  to  modify  Jazwinskl's  algorithm  In  order  to  eliminate 


one  of  the  matrix  Inversions.  By  factoring  out  pk\k*  equation  (2.4.9)  may 
be  written  as 


*  v  -  p*i*  p*1„rJ 


(2.4.10) 


Also,  solving  (2.4.9)  for  and  substituting  into  (2.4.8)  yields 

Xk  |  (N)  *  Xk\k  +  Pk  |  (N)  Pk  |m  ^k\k  "  £k  |m  ^ 


i ; 


Thus  the  inversion  of  p.  ,  has  been  eliminated,  and  (2.4.  to)  and  (2.4.  IP  are 

k  |A 

the  equations  to  be  employed.  It  is,  of  course,  necessary  to  obtain  £ .  and 

K  I  K 

hence  pk\k  is  required. 

The  limited  memory  filter  is  Implemented  as  a  sequential  estimator. 


Figure  2.5  diagrams  the  modified  procedure  which  uses  (2.4.10)  and  (2.4.1V. 


The  Kalman  filter  equations  are  run  from  k  =  0  to  k  -  m  -  processing  obser¬ 
vations  and  obtaining  estimates  in  the  usual  manner.  This  represents  an 
initialization  phase.  The  values  of  p  .  and  £  ■  are  stored.  The  Kalman 

/n  |  m  m|m 

equations  are  then  run  from  k  *  m  to  k  -  m  +  N,  a’so  in  the  usual  manner,  and 


the  state  transition  matrix  <t>  is  obtained  using  <fr  „  ,  =  a  ,  a  , 

m+N  ,m  3  k+2,k  k+2,k+l  k+l,k 

The  predicted  values,  P  .  and  £  ,,  are  found  using  <j>  and  equations 

r  m+W  |m  3  m+N,m  3 

(2.4.10)  and  (2.4.11)  applied  to  obtain  5  and  p  ./M1.  The  process 

is  then  repeated  (except  for  the  initialization  phase)  using  the  limited 
memory  filter  outputs  as  the  inputs  for  the  next  cycle  (from  -  =  2K  to  ni  =  su) . 


It  appears  from  examination  of  the  equations  for  finding  *  , 

K.  (  .. 

(2.4.9)  or  (2.4.11) ,  that  numerical  problems  could  be  encounte  p<l,  pa  ticularly 
where  short  word  length  computers  must  be  used.  Equations  (2.4.9)  and 
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Figure  2.5  Modified  Limited  Memory  Filter 
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(Z.'t.JO)  require  computation  of  the  Inverse  of  the  difference  of  the  Inverses 


of  two  matrices  of  similar  values. 

It  Is  possible  to  offset  this  state  of  affairs.  An  alternative  due 
to  Jazwlnski  (5,  258),  reduces  the  number  of  P-matrlx  inversions  to  one  by 
working  directly  with  the  Inverses  only.  Equation  (1.3.9)  from  section  1.3 
provides  the  necessary  recursion  relation.  Dropping  the  m-subscrlpts  from 
(1.3.9)  one  obtains  at  time  t ,  based  on  measurements  to  t  , 

(c.j. ir ' 


T 


.-1 


.-1 


P.i.  ,  =  <j>  \  .  P“*  I.  ,  *  *  ,  +  hJ  rJ1  h . 
k\k~l  k,k-l  k-l\k-l  k,k-l  k  k  k 

For  prediction  only,  the  information  term,  HT  R~]  H ,  for  time  is  ignored. 


Thus  at  t. ,  based  on  measurements  to  t  , 

K  ffl 

-?  -1  .-T  n-i  .-1 

Pi  =  <P  Pi 
k  |m  k,m  m\m  k,m 


(?..4. 13) 


This,  of  course,  requires  taking  the  inverse  of  <l>.  If  4>  Is  symplectic*,  its 
Inverse  Is  readily  obtained.  Alternatively ,  Instead  of  Integrating  {»h. 
the  Inverse  may  be  found  by  direct  Integration.  is  found  as  follows: 

<r 1  $>  =  i 

•—7  —7  • 

$  <j>  +  $  $  =  o 

1-1  tT1  A 

4>  =  -4>  A 


It  should  be  apparent  that  all  the  previously  presented  algorithms 
may  be  Implemented  such  that  x  and  P  are  obtained  by  Integrating  their 


*  An  even-dimens loned  matrix,  $,  is  said  to  be  symplectic  if 

<J>T  J  4>  =  J,  where  J  =  |  0  1 

L-i  o 

Post-multiplying  by  4>  1  and  pre-multiplying  by  J  gives 

-1  T 

*  -  -j  r  j 
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appropriate  differential  equations.  Similarly,  differential  equations  may 
be  obtained  for  the  prediction  and  update  of  P-i.  First, 

P~1  P  =  I 

PJ‘P  +  P  P  =  0 

•  —  ?  -1  •  —I 

p  =  -p  1  p  p  1 

and  using  equation  ( 1.2.39 )  with  Q  =  0,  one  obtains  for  the  prediction, 

P~1(t)  =  -p~1(t)  Aft)  -  AT(t)  P~1(t)  (2.4.14) 

The  differential  equation  for  updating  p-i  may  be  found  by  applying  a  limiting 
process  to  (2.4.12)  similar  to  that  used  in  obtaining  equation  (1.2.  39). 

First, 

P~1(i)  =  t)  P~1(t)  f1(-xs  t)  +  HT(x)  r~1( i)  H(i) 

=  [$-T(tt  t)  +  l~T(t,  t)  At  +  ...]  P~1ft) 

•  [<J >"Jrt,  t)  +  $~T(t,  t)  At  +  ...}  +  HT(x)  li~1( -X)  Hfx) 

Carrying  out  the  indicated  multiplication, 

P~1(l)  =  P~1(t)  -  AT(t)  P~1(t)  At  -  P~1(t)  Aft)  +  ...  +  llT( t)  R~1(~l)  H( T» 

Performing  the  limiting  process  requires  subtracting  P~1(t)  from  both  sides, 
and  dividing  by  At.  However,  when  taking  the  limit  as  A t  ■*  0  the  information 
term  becomes  Infinite  implying  that  the  observations  are  perfect  (no  observa¬ 
tion  noise).  This  Is  Inconsistent  with  the  postulated  observation  process, 
hence  we  replace*  R( x)  with  Rfi)/ At.  With  this  change,  the  limiting  process 
is  applied  to  obtain 

P~1(t)  =  -AT(t)  P~1(t)  -  P-1  Aft)  +  HT(t)  R~1(t)  Hft)  (•:.■:.  If) 

*  Further  rationale  and  a  discussion  of  a  white  noise  process  as  the 
limit  of  a  white  noise  sequence  is  given  in  (5,  83-84). 
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with 

P~1(0)  =  p'1 

When  the  Information  term,  Hr  e'1  h,  is  added  at  discrete  times,  the  approach 
used  is  to  predict  to  t  =  t  using  (2.4.14)  and  then  add  the  information 
terms  at  tk%  i .e. , 


Reverting  to  the  notation  associated  with  the  filter,  there  results 
tk 

P~\  =  -  f  [P~1(t\t  )  Aft)  +  AT(t)  P~1(t\t  )]  dt  (2.4.17) 

k  |m  1  m  1  m 

m 

pl\M  - 1  l-/*1  lP'lulW  Mt>  +  AT(t>  w  * 

v  t  j 

+  HT (j+1)  R~1(c+1)  H(j+1) 

with 

P_I(t  I 0)  =  0  (2.4.18) 

m' 

As  before.  It  is  not  necessary  to  compute  (although  P^  is  required) 
since  equation  (2.4.11)  provides  the  limited  memory  estimate.  Thus  only  one 
P-matrix  inversion  is  required:  that  of  P~ to  obtain  Pk\(Ny  However, 
the  inverse  of  the  state  transition  matrix  is  required,  or  else  (2.1.1?)  and 
(2.4.18)  must  be  used.  If  «J>-J  cannot  be  easily  obtained,  the  advantage  of 
this  approach  is  somewhat  decreased.  The  Implementation  of  the  alternate 
limited  memory  filter  is  diagramed  in  Figure  2.6. 
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w 


N(|uru  2.6  Alternate  limited  Memory  I  liter 
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2.5  Summary 

In  this  chapter  the  non-adaptive  error  compensation  algorithms  of 
age-weighting,  gain-scaling,  additive  gain  term,  and  limited  memory  filter¬ 
ing  have  been  presented.  Some  of  the  original  algorithms  have  been  extended 
to  provide  alternate  forms,  and  relationships  derived  showing  certain 
equivalences  among  the  first  three. 

Equations  (2.1.1)  give  the  basic  age-weighting  algorithm  as  conceived 
by  Tarn  and  Zaborsky,  with  Fagin's  form  given  by  (2.  l.  l) . 

Schmidt's  gain-scaling  algorithm  is  given  by  equations  and 

(2.2.3).  The  relationship  between  the  scaling  factor,  b ,  and  e  (=im-:T)  is 
given  by  (2.2.5).  Modifications  of  the  algorithm  are  given  by  the  alternate 
equations  for  determining  b ,  equations  (2.2.11)  and  (2.2.14). 

The  second  filter  design  of  Schmidt's  adds  a  judiciously  selected 
term  to  the  optimal  gain,  K.  The  resulting  gain  and  covariance  are  given  by 
equations  (2.3.2)  and  (2.3.5)  or  (2.3.6).  The  weighting  factor,  3,  associated 
with  the  additional  gain  term  is  shown  to  be  related  to  e  by  equation  (2.3.2', 
and  related  to  b  by  (2.3.7). 

Jazwinski's  limited  memory  filter  is  given  by  equations  (2.4.3)  and 
(2.4.9)  together  with  the  Kalman  filtering  algorithm.  A  modification  to  the 
basic  limited  memory  filter  eliminates  one  of  the  three  matrix  Inverses,  and 
is  given  by  (2.4.10)  and  (2.4.11).  An  alternate  form  employs  the  inverse  of 
the  error  covariance  directly,  thereby  reducing  the  number  of  matrix  inverses 
to  one,  and  is  particularly  advantageous  if  the  state  transition  matrix  is 
symplectic. 

In  the  next  chapter  we  turn  our  attention  to  adaptive  techniques. 


both  optimal  and  suboptimal . 


Chapter  3 


ADAPTIVE  METHODS 


3.1  Estimating  the  State  Noise  Covariance 

When  modeling  errors  are  assumed  to  be  represented  as  purely  random, 
uncorrelated  noise,  it  is  often  a  problem  to  determine  the  appropriate  value 
of  q%  the  state  noise  covariance.  As  indicated  in  section  1.5,  there  is  a 
preponderance  of  adaptive  techniques  which  may  be  used  to  estimate  the 
appropriate  noise  covariances.  Most  of  these,  particularly  Bayesian  and 
maximum  likelihood  approaches,  have  been  thoroughly  investigated  (25),  (26), 
(27),  (28).  In  this  section  an  adaptive  method  due  to  Jazwinski  is  presented 
which  offers  a  reasonably  simple  and  easily  implemented  scheme  for  adaptively 
estimating  the  state  noise  covariance  matrix.  While  the  algorithm  is  derived 
using  a  maximum  likelihood  approach,  an  interesting  property  is  that  in  a 
simplified  form  it  becomes  just  the  covariance  matching  technique. 

Consider  the  following  predicted  residuals,  called  the  innovations  (23) 


’*  '  ■'  -  "k  xk  -  "k  h  *  'V  -  o-  s<»k »/)  -  '<k  %  <*■*• » 

For  simplicity,  is  assumed  to  be  restricted  to  the  scalar  case.  Forming 

the  covariance  of  r,  one  obtains 

k 


E{^  -  Hk  pk  V  *  Rk 


(6.  l. : 


since  K(x  o J)  =  o.  P  is  given  by  equation  (1.2,17).  It  can  be  shown  (23) 

K  K 

that  the  innovations  given  by  (’6.1.1)  are  uncorrelated,  i.e.. 


P{i‘k  rO  «  0t  ,j  >  k. 
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and  that  r  is  a  Gaussian  white  noise  sequence.  Thus  the  joint  probability 
density  function  of  the  innovations  sequence  rk+1*  *k+2*  **•»  *k+N  1S 

w  — *  W  *  1>  •  p'W  •  •••  '3-3-3> 


where 


p(rk+j}  =  i^iir  [WJt+j  Pk+j  Uk+j  + 

«*>  i-*r — — l! 

'  L  Hk+i  Pk+1  Hk+1  *  Pk+j  ■» 


i  I  .  ■  * 


/c+j  /c+j  k+j  '  "k+j 
Suppose  during  operation  of  the  filter  equations  (say,  Algorithm  I)  that 
t  »  t  and  the  predicted  values  Pk+1  and  *  are  to  be  computed.  For  ,v 
predicted  residuals  the  object  is  to  find  Qk\N  (the  state  noise  covariance  at 
tk  based  on  N  innovations)  such  that  (3.1.3)  is  maximized.  In  other  words, 


Q 


k\N 


is  that  value  of  Q  which  yields  the  most  likely  innovations  sequence 


rk+is  vk+2*  **•  rk+N  ’  Thus  very  1ar9e  values  of  Q  imply  less  likely 
sequences  of  the  Innovations. 

Consider  the  maximization  of  (3.1.3)  for  the  case  of  one  residual. 
First,  the  likelihood  function  is  chosen  as  the  joint  density  (3.1.4) ,  and 
its  logarithm  is  taken  to  facilitate  the  maximization  (since  p  and  its 
logarithm  are  monotonic,  extremizing  In  p  extremizes  p). 


J  =  In  p(rk+1)  =  ~  l n  M 

- ) 

'  Hk+1  Pk+1  Hk+1  *  Rk+1  r 

Taking  the  variation  with  respect  to  Q  gives 

l 


k+1  Pk+1  Hk+1  +  Rk+1} 


6J  =  -  % 


(H  P  HT  +  R) 


+  h 


r2 


[H  P  HT  +  R]2 


H  6  P  HT 


(3. 1.5) 


(3.1.6) 

and  assuming  H6PHT  >  0,  the  necessary  condition  5 J  =  0  yields  the  equation 
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-  //  V  H  -  It  =  0 

The  sufficient  condition  for  a  maximum  is  62j  <  o\  hence. 


(H  P  UT  +  R)  2  [II  P  HT  +  R]  3 


(3.1.7) 


r>2  >  H  [U  P  H  +  Ft] 


which  is  met  if  (3.1.7)  holds.  Next,  using  (l. ?..!?), 

Hr  o  rT  = 

k+1  k+l,k  *k\l  k+1 ,k  k+1 

P2  -  H  4>  P  llT  -  R 

k+1  k+1  k+l,k  k  k+1, k  k+1  k+1 


( 1.0) 


This  equation  is  simply  the  equation  for  the  covariance  matching  approach  to 
estimating  Q  based  on  one  residual,  r  .  In  evaluating  (3.1.0)  the  actual 
residual  given  by  equation  (3.1.1)  is  used.  Further,  for  Q  to  be  non-negative 
definite,  the  right  hand  side  must  be  positive.  Otherwise  Qk^2  is  set  to 
zero.  Unfortunately,  HY  is  usually  not  invertable,  hence,  resort  must  be 
made  to  a  pseudo-inverse  or  some  other  technique.  Defining,  for  convenience, 
AEi/r 

the  pseudo  inverse  of  A  (45,  82-89) 


yields 


"4  -  H  *  P k*T  HT  -  R] 

AAr  K+1  k  AAr 


(3.1.10) 


Equation  (3.1.10)  does  not  provide  a  unique  solution  for  Q.  However,  in  the 
special  case  where  Q  is  a  scalar,  say  q ,  one  does  obtain  a  unique  solution: 


r2  -  H  $  P  V  HT  -  R 


(3.1.11) 


77 


Here  again  we  require  the  right  hand  side  to  be  non-negative.  A  disadvantage 
is  that  one  residual  is  hardly  a  sound  statistical  sample  upon  which  to  base 
an  estimate  of  Q.  A  pat  hoc  modification  found  to  be  effective  is  to  employ 
the  sample  mean  of  the  square  of  N  predicted  residuals, 

\  - 1  4  n  ■  *  * 1  '  l  i: 

0=0 


in  place  of  r  *  .  This  requires  storing  n  observations,  which  would  not  be 
so  bad  in  itself,  but  the  state  estimate  then  either  periodically  lags  the 
observations  by  N  points,  or  else  the  last  N  estimates  must  be  reprocessed. 
Either  way  the  result  is  the  same,  but  the  former  is  computationally  more 
efficient  since  the  state  estimate  is  obtained  only  once  for  each  time  point. 

Another  modification  which  is  computationally  easier  to  employ  and 
which  does  not  severly  degrade  the  accuracy  of  the  algorithm  is  to  use  the 
sample  mean  of  the  history  of  the  last  N  predicted  residuals  squared.  In 
this  case. 


1  N-l 

yN  =  N  ? 

0=0 


m-j 


(3.1.1?) 


In  this  approach  the  filter  equations  are  operated  for  the  first  n  observa¬ 
tions,  storing  the  predicted  residual  at  each  time  point,  t  .  For  each 
m  <  /!/,  the  sample  mean, 


Y 


m 


l 

m 


h  Z, 


(3.1.14) 


is  formed  and  used  in  the  estimation  of  Q.  For  each  m  >  N  a  new  squared 
residual  is  added,  and  the  oldest  one  is  discarded.  In  this  way  a  "moving 
window"  viewinq  the  most  recent  n  squared  residuals  is  maintained.  This 
modified  approach  is  diagrammed  in  Figure  3.1. 
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Note  that  both  of  the  approaches  reduce  to  the  optimal  case  of  one 
predicted  residual  when  N  =  l.  In  this  case  (3.1.12)  and  (3.1.13)  are 
identical . 

Further  modifications  are  possible  to  improve  the  single  residual 
estimator  {5,  314);  however,  these  tend  to  become  rather  complex.  Also, 
when  more  than  one  residual  is  used  in  (3.1.3),  the  resulting  set  of  equations 
to  be  solved  is  nonlinear.  In  fact,  they  are  basically  the  types  of  equations 
found  in  other  adaptive  filtering  methods  (26),  (28). 


3.2  Adaptive  Estimation  of  Suboptimal  Filter  Parameters 

Since  the  suboptimal  methods  of  Chapter  2  effectively  alter  the 
state  error  covariance,  it  is  reasonable  to  assume  that  the  adaptive  algorithm 
of  section  3.1  can  be  applied  to  estimate  the  various  suboptimal  filter  para¬ 
meters.  In  each  case,  relating  the  parameters  to  the  state  noise  covariance 
provides  the  necessary  link  to  allow  the  adaptive  estimation  of  the 
parameters. 

For  the  data  age-weighting  technique,  we  can  equate  the  standard 
error  covariance  prediction  equation  (1,2.17)  to  the  age-weiqhted  error 
covariance  prediction  equation  (with  q  =  o)  : 

Pk+i  =  Pk  <t>T  +  r  q  \'T  =  <l>  s  P  <J>T  (3.r. 

Using  this  relation  in  (3.1.7)  gives 


Again  the  poit  hoc  modification  is  nude  r  <  y,  given  by  (3.1.1V)  or 

N 

is  utilized. 


Since  o*  is  required  to  be  greater  than  or  equal  to  u  (3.2.3)  may  not  be 
employed  until 

y„  >  //  <J>  P  $T  HT  +  R 

/V  — 

To  adaptively  estimate  the  gain  scaling  parameter,  />,  in  section  2.2, 
the  optimal  and  suboptimal  gain  expressions  are  equated. 

b  P'  HT(H  P'  HT  +  R r1  =  (P'  +  Q')  HT(U  P '  HT  +  H  Q'  HT  +  R)"1  ,  . 

where  p'  =  $  p  <t>T  and  £'  =  r  Q  VT.  Solving  for  H  Q'  HT  one  obtains,  after 
premultiplying  by  Ht 

b  H  P'  HT(H  P'  HT  +  H  Q'  HT  +  R)  «  H(P'  +  Q')  HT(H  P  HT  +  R) 

h  q'  ht  =  h  r  q  rT  ht  =  ---p— fi’-— (3.2.0; 

*  [/?  -  rt-j;  #  p'  ffr] 

Substituting  into  equation  (3.1.9), 

(b-1)  H  P'  HT(H  P'  HT  +  R)  m  -Up'  RT  _  R 
R  -  (b-1)  H  P'  HT  N 

which,  solved  for  b ,  yields 

(yN  -  R)  [R  *  p  *T  ht  +  R] 

b  «  — - — — -  >1  (3.2.7) 

H  $  P  $T  HT 

where  the  inequality  on  the  right  is  a  condition  on  b  established  in  section 
2.2.  Also  it  is  noted  that  equation  (3.2.7)  requires  yN  >  R,  a  condition  on 
its  use. 

The  modification  (2.2.11)  to  the  gain  scaling  technique  may  be  formu¬ 
lated  similarly  to  adaptively  estimate  its  associated  parameter,  a.  Equating 
(2.2.12)  with  the  gain  equation  gives 
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.  L~L{—  a  =  CP'  +  Q')  HT(H  P'  HT  +  K  Q'  HT  +  R)'1  (5.2.3) 

H  P'  HT 

which,  when  premultiplied  by  H  and  solved  for  H  Q'  HT  yields 

iiq.„t  =  3jLz  »-o)  !ULSl 

Substituting  into  (2.1.9)  and  solving  for  a  yields  the  simple  equation, 

a  =  1  -  R/y  <  2.2.  ;V 

N 

Conditions  on  a  require  that  (2.2.10)  produce  an  a  >  //  K. 

For  the  modification  given  by  (2.2.14)  the  appropriate  adaptive  form 
is  similarly  found.  Equating  the  gains, 

(P'  +  Q ')  HT(H  P'  HT  +  H  Q'  HT  +  R r1  = 

( P '  HT  +  B  R  P  HT/H  P'  HT)  (H  P'  HT  +  hf1  (2.2.  11) 

premultiplyinq  by  H  and  solving  for  //  Q'  flT  gives 

11  I‘ '  UT(11  P'  llT  +  R)  +  H  Q'  llT(H  P'  HT  +  R)  = 

11  P'  11T(H  P'  11T  +  R)  +  H  P'  11T  H  Q'  Hr  +  3  KOI  PA  KT  +  R)  +  3  .V  ll  Q'  Hr 

HQ'  llT  h‘(l  -  B)  =3  R(H  p'  llT  +  R) 

ii  q'  nr  =  h  r  q  rT  nr  =  h) 


(As  before,  p'  -  <j>  p  and  Q'  =  r  Q  rT.)  Substituting  into  (3.1.9)  and 
solving  for  3  one  obtains 

B  =  !  -  (H  P'  !{T  +  F)  =  l  -  f//  *  P  *r  hT  *  A’]  (  ’ 

yN  yN 

with  the  conditions  on  3  that  0  <_  3  ^  1  and  hence 

Y.,  >  H  $  P  4>T  HT  +  R 

N  — 


i.°) 
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Finally  we  note  that  for  the  additive  gain  term  technique,  a  similar 
procedure  to  determine  the  adaptive  form  yields  results  identical  with 
equation  (3.2.12).  This  is  true  because  equating  gains  and  premultiplying 
by  //  gives  H  3  ft  Hr/H  ftT  =  [i  ft,  and  from  then  on  the  equations  are  identical. 

The  previously  discussed  suboptimal  adaptive  algorithms  are  easily 
implemented  in  accordance  with  Figures  3.2,  3.3,  and  3.4.  Either  of  the 
predicted  residual  sample  means  discussed  in  section  3.1  may  be  used;  hence 
their  computation  is  not  shown  in  these  figures. 

3.3  Estimation  of  Auto-Correlated  Model  Errors 

One  of  the  most  effective  and  useful  ways  to  compensate  for  dynamic 
modeling  errors  is  to  estimate  such  biases  directly,  including  them  as  part 
of  the  state  vector.  In  general,  model  errors  are  not  purely  random,  but  have 
time  correlated  components  as  well.  Thus,  by  assuming  some  functional  form 
for  these  components,  their  values  may  be  estimated  from  observation  point 
to  observation  point. 

There  are  several  advantages  in  this  approach.  First,  of  course,  is 
that  the  estimation  accuracy  is  improved  since  model  errors  are  compensated. 
Second,  insight  into  the  nature  of  the  modeling  errors  is  obtained  as  an 
additional  benefit,  and  through  off-line  data  analysis,  the  form  of  the 
dynamic  model  can  be  refined  for  future  use.  Third,  since  the  state  vector 
is  simply  augmented,  the  estimation  equations  remain  essentially  unchanged 
from  the  classical  Kalman  sequential  form.  Of  course,  lengthening  the  state 
vector  increases  the  computational  load;  however,  this  is  not  often  a 
detriment  considering  the  state  of  present  computer  technology. 

A  further  motivation  for  the  adaptive  estimation  of  model  errors 
should  be  noted.  It  is  true  that  small  bounded  errors  may  be  effectively 
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compensated  (as  lonq  as  filter  operation  Is  within  the  range  of  linearity 
of  the  system  dynamics)  by  using  a  "state  noise"  approach,  l.e.,  simply 
determining  the  ^-matrix.  However,  for  large  or  unbounded  errors  such 
an  approach  is  usually  Impractical  If  not  Impossible.  Thus  extracting  as 
much  of  the  auto-correlated  error  as  possible  from  the  model  uncertainties 
results  In  a  smaller,  purely  random  component.  Consequently,  the  associated 
state  noise  covariance  matrix,  Q ,  Is  less,  and  since  this  establishes  a 
minimum  upon  the  state  error  covariance  matrix,  a  smaller  q  offers  greater 
confidence  in  the  state  estimate. 

The  question  concerning  the  type  of  functions  to  be  used  In  modeling 
the  correlated  errors  is  really  an  open  one.  The  answer  depends  upon  the 
application  where  two  important  considerations  are  the  filter  operating  time 
and  the  sampling  interval  (more  will  be  said  of  this  later).  Approaches  have 
Included  the  use  of  simple  power  series  In  time,  Tchebycheff  polynomials,  and 
Fourier  series.  For  our  purposes,  the  term  structure  Is  Introduced  to  mean 
the  functional  form  of  the  correlated  model  errors.*  Thus  the  structure  may 
be  11ne;r  or  nonlinear  according  to  the  form  of  the  model  error  approximating 
functions.  Incidentally,  the  case  of  purely  random  errors  Is  denoted  here 
as  a  null  structure  —  no  time  correlated  components.  Usually,  the  assumed 
error  form  is  some  linear  combination  of  a  linear  or  nonlinear  structure  and 
a  null  structure,  i.e.,  of  a  time  correlated  component  and  a  purely  random 
component. 

The  Incorporation  of  model  error  structures  into  the  estimation  process 
can  occur  In  basically  one  of  two  explicit  ways.  Denoting  the  vector  of 


*  Lainiotis  (38)  uses  structure  to  mean  the  dimensionality  of  the  state 
vector.  Our  definition  may  be  seen  to  include  this  aspect. 
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error  components  by  e ,  one  can  write  a  differential  form, 

eft)  =  gfe,  a,  t)  +  h(t)  u  (t) 

G 

aft)  =  0  (3.3.1) 

or  an  algebraic  form,  eft)  =  Gfe ^  a,  t)  +  Hft)  u  ft)  with  the  accompanying 

u  e 

differential  form 

eQ(t)  *  0 

aft)  -  0  (3.3.':' 

where  a  is  a  parameter  vector  of  constants,  and  eQ  is  the  Initial  condition 

vector,  eftj ,  and  w  is  state  noise  with  covariance  Qft).  u  ft)  Is  zero 
0  e  e 

mean,  uncorrelated  noise  with  covariance*  E{u  ft)  u  T(8)}  ~  Qjt)  6ft  -  s). 

e  e  s 

Which  form  Is  used  is  of  no  particular  Importance  in  the  subsequent  discussion. 
If  a  closed  form  solution,  or  even  an  approximate  solution,  is  available  for 
the  expected  value  of  (3.3.2),  an  easier  Implementation  may  be  possible  since 
the  corresponding  components  of  the  state  transition  matrix  are  simply  the 
Identity  matrix  of  appropriate  dimension.  If  no  explicit  solution  of  E{e } 

Is  available,  clearly  (3.3.1)  Is  to  be  used.  Further,  if  gfe,  a,  t)  is 
nonlinear  in  e  and  a,  an  appropriate  linearization  must  be  made  In  order  to 
use  the  estimation  equations  described  In  section  1.2. 

Since  the  model  errors  are  estimated  as  part  of  the  state  vector,  each 
time  the  estimate,  eft),  is  obtained  the  model  Is  updated.  Thus  If 
a  =  ffx,  e ,  t)  is  the  subvector  of  the  complete  state  vector  time  derivative. 


*  6(t  -  e)  is  che  Dirac  delta.  Although  is  not  differentiable  in 

the  classical  sense,  u  can  be  thought  of  as  the  "derivative"  of  u  ,  and 

e  e 

hence  as  an  uncorrelated  random  variable  with  Infinite  covariance.  A 
discussion  of  stochastic  calculus  is  given  in  (5). 
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•  A 

x(t)t  for  which  the  errors  are  to  be  Included,  then  a  *  f(£,  t)  represents 
the  updated  portion  of  the  model . 

Note  that  In  (3.3.1)  or  (3.3.2)  the  adaptive  nature  of  the  algorithm 
lies  In  the  fact  that  the  constant  parameter  vector,  a,  Is  estimated  as  well. 
Hence,  If  a  reasonably  accurate  structure  Is  assumed,  subsequent  estimation 
provides  accurate  values  of  a.  There  Is  a  danger  In  using  the  forms  as 
presented  In  (3.3.1)  and  (3.3.2).  As  the  estimation  process  Is  carried  out, 
the  a- vector  converges  to  a  constant  value  with  the  corresponding  error  co- 
variance  and  hence  gain  terms  approaching  zero.  Thus  If  the  assumed  structure 
Is  not  an  accurate  approximation  over  the  filter  operating  period,  the  eventual 
"hardening"  of  the  structure  as  a  takes  on  Its  constant  value  may,  and  usually 
will,  produce  divergence  of  the  estimate.  Since  one  either  does  not  really 
know  the  error  structure  or  else  It  Is  computationally  too  complex.  It  Is 
usually  a  good  Idea  to  model  the  a-vector  with  a  null  structure,  l.e.,  as  a 
random  parameter: 

aft)  -  u  t  F.{u  }  -  0t  E{u  (t)  u  T(t)}  -  Q  (t)  6(t-  -  T.)  (3.3.3) 

an  a  a  a 

This  keeps  the  error  covariance  from  vanishing,  and  the  gain  will  remain  at 
some  significant  non-zero  level  such  that  the  value  of  a(t)  Is  changed  to 
allow  the  assumed  structure  to  conform  more  accurately  to  the  actual  model 
errors . 

Before  discussing  some  specific  examples  of  the  structures  used  In 
the  Investigation,  It  will  be  advantageous  to  specifically  define  the  error 
vector,  e  (dimensioned  r),  and  parameter  vector,  a  (dimensioned  «) ,  with 
regard  to  the  previously  mentioned  forms.  The  differential  approach  is 
selected  so  that 

e(t)  =  g(e ,  a ,  t)  +  h(t)  ujt) 

a( t)  *  uj t)  3. ■ ' 

A 
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We  define  the  random  variable 


UT(t)  -  [ue(t)\ua(t)\ 


with  statistics 


li{u(t)}  =  0,  K{u(t)  u  (r))  -  i)i  t)  \(  I  -  r) 

For  the  cases  where  the  algebraic  form  is  to  be  employed,  ,•  (>i  is  simply 
Included  In  the  parameter  vector,  a.  The  form  (3.3.J)  will  be  used  as  a 
general  reference  In  the  sequel,  with  the  appropriate  modifi cations  for 
eQ(t)  being  understood.  Finally,  the  implementation  is  shown  in  Figures 
1.1  and  1.2  with  the  modification  that  the  state  vector  includes  the  compon¬ 
ents  e  and  a\  i.e. ; 


(3.3.5) 


xft)  13  redefined  to  be  eft) 

a(  t) 


where  the  x(t)  on  the  right  is  the  original  state  vector. 

A  Simple  Linear  Structure.  One  of  the  simplest  model  error  structures 
is  a  linear  function  of  time. 


eft)  “  °q  +  t 


(3.  3. 6) 


where 


Hence, 


t  a(  t) 


ujf t)  *  u( t) 


(3.  3.  7) 


and 


Note  that  without  the  presence  of  state  noise,  uQ  and  ujt  the  coefficient 
terms,  cQ  and  o  1%  will  take  on  nonvaryina  values,  and  the  resulting  "hardened 
linear  structure  will  probably  not  represent  the  model  errors  very  well 
(unless,  of  course,  the  error  actually  linear).  The  presence  of  the 
state  noise  keeps  the  gain  from  vanishing,  and  on  the  average  should 

approximate  the  model  errors  fairly  well  over  a  short  interval. 

Further  discussion  and  applications  of  this  model  are  given  in 
Chapters  4  and  5. 

The  Ornstein-Uhlenbeck  Stochastic  Process.  Another  simple  structure 
is  that  introduced  by  Ornstein  and  Uhlenbeck  as  a  model  to  describe  the 
velocity  of  a  particle  undergoing  a  Brownian  motion  (40,  516).  This  model 
has  been  used  successfully  by  Ingram  (32),  Tapley  and  Ingram  (41),  and 
Tapley  and  Hagar  (34),  (35),  in  orbit  determination  studies.  The  Ornstein- 
Uhlenbeck  process  obeys  a  simple  linear,  first  order  differential  equation 
(Langevin's  equation)  of  the  following  type: 
c(t)  + 

where*  a  is  a  vector  of  constant  parameters  and  u 'i)  is  Gaussian  white  noise 
The  process  is  stationary,  unbiased,  and,  by  solution  of  (.?.?.$),  is  expo¬ 
nentially  correlated  in  time.  Further  discussions  of  the  properties  of  this 
process  are  given  in  (5,  70-74)  and  (40,  516-524).  For  our  purposes 
application  to  adaptive  model  error  compensation  gives 


*  The  notation  t'aj  Implies  a  diagonal  matrix  whose  elements  are  the 


components  of  the  vector,  a. 
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eft)  =  eft)  =  -f'aj  eft)  +  u^ft) 

aft)  =  aft)  =  u  ft) 
a 

(3.2.10) 

Further  discussion  and  applications  of  this  model  are  given  In  Chapters  4 
and  5. 

Harmonically  Bound  Motion.  If  a  particle  undergoing  Brownian  motion 
Is  suddenly  subjected  to  a  displacement-proportional  force,  the  equation  of 
motion  Is  simply  that  of  a  randomly  forced  harmonic  oscillator  (40,  524-525): 

eft)  +  r  a  J  eft)  +  J  eft)  =  k  (3.3.11) 

where  a  and  3  are  constant  parameter  vectors,  and  u^ft)  Is  Gaussian  white 
noise.  This  process  Is  stationary,  unbiased,  and  has  autocorrelation  properties 
dependent  upon  the  signs  of  ["a  J  and  fs  J.  This  equation  has  been  employed 
successfully  as  a  model  error  compensation  structure  In  low  thrust  space 
vehicle  navigation  by  Tapley  and  Hagar,  (34)  and  (35).  Application  of  this 
structure  to  adaptive  error  compensation  gives 

n  ft) 

-  [“dj  n ft)  -  f  3  J  eft)  +  i<c(t) 
ujtn 

*  U  ft) 

V'-'J 


0 

ucn'> 

“of" 

V" 

Further  discussion  and  applications  of  this  model  are  given  In  Chapters  4 
and  5. 
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3.4  Linear  Transformation  of  the  Ornstein-Uhlenbeck  Process 

The  model  error  forms  presented  In  the  last  section  were  chosen  In 
an  apparently  arbitrary  fashion.  In  this  section,  a  unifying  derivation  Is 
presented  from  which  each  of  the  previous  models  can  be  obtained  under  appro 
prlate  assumptions.  The  general  result  Is  a  representation  shown  to  be 
equivalent  to  an  exponential  multiplying  a  Fourier  series. 

Consider  the  Ornstein-Uhlenbeck  process  of  equation  (3.3.9).  Let 
z(t)  be  a  (21  +  U-vector  related  to  the  r-vector,  c(t) ,  by  a  constant 
r  x  (21  +  l)  matrix,  C' ,  whose  values  may  be  complex. 


e( t)  =*  C'  e( t) 


(3.4.1) 


Further,  let  z(t)  satisfy  the  differential  equation  (3.3.9). 
component  of  e(t)  , 

21+1 


e^(  t) 


rml 


o'  e  (t) 
jm  m 


For  the  ,7-th 


21+1 

-  E 


T!T=1 


21+1 

-  E 


iml 


o'  le  [t  )  exp(  -a  t)  +  |  cxi'(  -a  (*)  u  (e)  d  el 
jm '  m  r  m  J '  r  m  m  * 

0 

IE  exP(  "a  t)  +  u)']  (3.4.2) 

jm  m  0  c  m  w 


where  the  a  are  also  allowed  to  be  complex.  Taking  the  expected  value,  the 

m 

second  term  In  brackets  vanishes.  By  absorbing  the  e(tj  Into  the  o'  ,  one 

o  jm 

then  obtains 

21+1 

E{e  (t)}  I  c*(t)  =  o  exp(  -a  t)  (3.4.3) 

J  J  >  "> 

with  a,  *  a'  e  (0).  This  is  the  general  form  of  the  transformation.  With 

jm  jm  m 

a.  and  a  complex,  the  only  restriction  Is  that  c  .*(t)  is  usually  required 

jm  m  j 

to  be  real  In  order  to  represent  real  model  errors.  If  a  Is  further 

m 

restricted  to  be  of  the  form 


<»  -»).#/  W  (•' 

m  ;  j 
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(5.4.4) 


where  i  =  /T7,  and  If  the  limits  on  the  summation  are  changed  to  run  from 
-l  to  +1,  (3.4.3)  becomes 


e*(t)  «  exp(  -Pj 


t) 


With  o  .  of  the  form 

jta 

vf 


'jm 


e  .  -  i  r. 

JJZ _ is. 

2  * 


a.  +  i  r. 

0  ,  Jl a _ Jul 

°  jm  2 


rrpB  - 1 


°)m  e*P<  ■ 


i  m  to.  t) 


m  >  0 


m  <  0 


then  equation  (3.4.5)  can  be  written  as 

l 

e/(t)  =  ex p(  -  p  t)  53  (a.  oo8(m  to.  t)  +  r.  sin(m  to.  t)} 
1  1  rrr-0  jm  *  j 


(3.4.5) 


(3.4.6) 


where  use  has  been  made  of  the  definitions 

cos  <p  *  [expd  $)  +  exp(-  i  <p)]/2,  sin  <j>  *  [expfi  $)  -  exp(-  i  <J ))]/2i 

Thus  under  the  foregoing  restrictions,  the  process  reduces  to  an  exponential 

multiplying  a  truncated  Fourier  series.  Some  special  cases  are  now  considered 

which  yield  the  models  of  section  3.3. 

Case  1.  Let  l  -  U  Pj  =  0%  and  using  the  series  expansion  for  sin  and 

nos  through  the  first  order,  (3.4.6)  becomes 

e  *( t)  +  rv  to.  t  (5.4.9) 

j  jl  jl  j 

which  becomes  the  linear  form  (3.5.0  upon  making  the  identifications 


°0  *  °jl  *  °0  ~ 

- 1 

...  {J 

Q 

_ 1 

J 

rjl  "  rjl  “ j  >  C*i  " 

m°um 

• 

• 

• 

L^rlJ 
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Case  2.  Let  l  =  0,  but  retain  the  exponential  form.  Then  'z.4.6j 

gives 

e^*(t)  •8jQexp(  -Pj  t)  (3.4.8) 

which  Is  the  solution  to  the  differential  equation 
e^*(t)  =  -Pj  ej*(t) 

This  Is  obviously  of  the  form  of  the  mean  of  (3. 3.t ))  with 
a.  =  Pj 

Case  3.  Let  1=1 ,  and  set  n  =  0.  Then  (3.4.6)  becomes 

-  j  o 

ej*(L)  =  exp(  -p_.  t)  [i!j1  /'on  Uj  t  +  r.ifi  u>_.  (3.4.6) 

which  Is  the  solution  to  the  harmonic  oscillator 


e^*(t)  +  2Pj  e^*(t)  +  (p2  -  uS)  Cj*(t)  *  0 

Making  the  Identifications 


2 


the  result  Is  of  the  form  of  the  mean  of  (3.3.11). 

It  should  be  apparent  that  more  complex  structures  can  be  obtained 
easily  from  (3.4.6).  For  example,  without  restricting  e  to  be  zero,  the 
resulting  form  of  (3.4.6)  with  7  =  1  satisfies  the  nonhomogeneous  differential 
equation. 


e^*(t)  +  Pj  ej*(t )  +  Uj2  e*(t)  +  u^2  8 cxp(  -Pj  t)  =  0 


3.5  Structural  Adaptation 

With  the  general  form  (3.4.6)  established  as  being  capable  of  repre¬ 
senting  a  number  of  different  functions,  it  Is  possible,  at  least  In  theory, 
to  adaot  structurally  to  a  close  approximation  of  the  dynamic  model  errors. 
By  defining  the  vector  of  model  errors  and  parameters. 


\ 
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z(  t) 


cjt) 

e  it) 

m 

P  j(t) 

pit) 

m 

t) 

• 

0)  (t) 
m 


and  selecting  m  to  be  large  enough  (based  on  the  degree  of  sophistication  of 
the  a  pfuofu  dynamic  model),  the  filtering  equations  (Algorithms  I  or  II, 
Chapter  1)  will  produce  values  for  the  and  uk  which  will  result  in  an 
approximation,  e_.t  to  the  true  structure  of  the  model  error*.  The  obvious 
disadvantage  Is  In  the  Implementation:  a  large  m  produces  a  very  significant 
computational  load.  Thus  the  flexibility  for  structural  adaptation,  as 
offered  by  a  form  such  as  (3.4. 6),  Is  partially  offset  by  the  need  to  exercise 
parexic  judgment  in  selecting  a  value  of  m. 


3.6  A  Sequential  State  Noise  Covariance  Estimator 

In  this  section  we  return  to  the  problem  of  estimating  the  state  noise 
covariance  matrix.  A  new  approach  Is  taken,  although  the  technique  Is  simply 
the  formulation  of  a  Kalman  filter  algorithm  applied  to  a  vector  whose 
elements  are  those  of  the  state  noise  covariance  matrix,  Q.  With  this  ob¬ 
jective  In  mind,  once  we  have  obtained  the  analogous  state-observation  and 
state  dynamics  linear  relationships,  the  appropriate  estimation  equations  can 
be  written  down  Immediately.  The  resulting  method  will  be  shown  to  yield  a 
minimum  variance  estimate  of  the  state  noise  covariance  matrix  elements, 
subject  to  the  additional  restriction  that  Q  be  non-negative  definite. 
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We  begin  by  establishing  the  square  of  the  fcth  predicted  residual  as 
the  required  observation.  From  section  3.1,  the  predicted  residual  Is 
defined  as 


-  **  ** 


"  Hk  *k,k-l  */r-l 


H.  ♦.  .  x.  +  H  r.  .  .  w.  .  *  v. 

JC  /C  /C  y/C* *  A™ X  A 


Continuing  with  the  previous  assumption  that  we  are  dealing  with  scalar 
observations,  we  then  form 


El\  -/>  -  -  Hk  pk  Hk  *  Bk 


and  define  the  error,  e,  as 


(3.6.?) 


\  ■  V  ■  E(rk] 

where  £{6}  »  0.  Then 

V  *  "k  pk V  +  Rk*\ 

Further,  assume  that  Rk  Is  exactly  known  and  define 

w-** 

vk-Hk\Hk 


(3.6.3) 


(3.6.4) 


(3.6.3) 

(3.6.6) 


so  that 


DkmVk  +  \ 


(3.6.7) 


Now  Vk  Is  a  function  of  $  ,  since 

P  «  a  P  a  T  *  r  q  rr 

k  k,k-l  k-1  k,k-l  k,k-l  **-I  k,k-l 


(3.6.8) 


and  It  Is  that  we  are  trying  to  estimate.  If  a  Kalman  filtering  algo¬ 
rithm  Is  to  be  developed,  then  a  linear  equation  relating  to  r  must 
be  obtained.  To  this  end,  the  elements  of  Q  are  first  placed  In  a  vector,  j. 
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such  that  the  elements  are  stored  row-wise.  Further,  use  Is  made  of  the 
fact  that  Q  Is  symmetric  so  that  only  the  upper  triangular  portion  Is  stored. 
Thus, 

T 

Q  *  . .  <  *  *  i  (7  am  *  *  ■  £7  m  C7  . .  i*i  ^  .  q  )  ( 3 .  6 . 9 ) 

^  n 11  ^ lm  ^22  ^2m  ^33  ^ m-l,m  ^mm 

where  m  *  m  Is  the  dimension  of  Q ,  and  qk_1  *  q(tk_ })  Is  therefore  the 

m(rrH-l)/2  x  l  state  vector  to  be  estimated.  Proceeding  formally,  consider  a 

Taylor  series  expansion  for  v  expanded  about  the  value,  q  =  o%  and 

A  ft"  2 

truncated  to  the  linear  term: 


vw 


3r. 


Vi 


aVi 


lk-l 


qic- 


k-1 


(  3.6.10) 


Vk(0 )  Is  simply  (3.6.6)  with  Qk_1  =  0.  In  analogy  with  the  linearized  observa¬ 
tion  state  equation  (1.2.23),  we  use  (3.6.10)  and  write  (3.6.7)  as 

+  \  ,3-6AV 


where 


-  V.  (0) 

k 


H. 


dVk 

J.  =  3~~ 


1 k-1 


(3.6.17) 


The  final  task  In  developing  the  components  of  the  observation  equation  Is  to 
determine  the  elements  of  Jk.  Since  P^,  and  hence  Is  linear  In  , 
equation  (3.6.10)  Is  exact;  we  write  (3.6.10) 

vi /w  “  vw  +  ivkt0> 

The  expansion  of  vk  with  respect  to  Qk_1  yields 


(3.6.13) 
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6r//*  \  Hk> 


h .  r 


6Q 


H. 


k  k,k-l  k-1  k,k-l  k 


Hk  ^ k,k-l  ®k-l 


T 

,k-l 


(3.6.14) 


where  -  GJ_2  -  Qk_J  since  the  expansion  Is  about  the  nominal  value 

Q*_2  -  ft.  Now  the  key  factor  In  the  development  hinges  on  the  ability  to 
express  the  elements  of  H  Q  ht  In  the  form 

J  qT  ( 3.6.15 1 


where  J  Is  composed  of  the  elements  of  H  and  r.  To  this  end  we  digress  for 
a  moment  and  prove  the  following: 

Theorem  I.  Given  the  matrices  A,  Bt  c  and  X  of  dimensions  7  x 
m  x  n,  nl  x  m2 ,  and  m  x  m  respectively,  then  the  l  *  n  matrix,  AXBt  can  be 
written  In  the  vector  form,  Cx%  where  xT  =  (x,.  x,_...x,  x x„,.  ..x,  ...x  )t 

11  12  lm  22  23  2m  mm 

an  m2- vector,  and  c  =  {<?  }  with  the  being  given  by 

°ij  =  a\(i-l)/n+l\,\(j-l)/m+l\  b[mod(j-l,m)  +  l)\3[mod(i-ltn)  +  1] 

where 

Is  I  *  integer  part  of  s 

and 

mod(83  t)  =  8  -  |j|  t 


Proof.  Writing  axb  In  expanded  form  results  in  the  following 


sequence: 


11 


lm 


CL*  •  •  i  •  CL 
11  m 


CL  -  .  •••  CL  . 

11  lm 


all  alm 


x 1 -  ...  x. 
11  lm 


x  -  ...  x 
ml  mm 


m 

l  xlj  bjl 
3 


rn 

J  X  .  b  , 

;  mj  Jl 

tl 


b , ,  ...  b  , 
21  in 


fc  ,  ...  b 
ml  mn 


m 


.  1  x ,  j  b  . 

j  lj  Jn 


m 

.  I  x  .  h 
J  mj 


jn 


AXB  = 
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m  m 

?  au  l  xu  hji 

i  ./ 


m  m 

.  .  y  a.,  y  i. . i), 

h  li  L.  i]  in 

l  3 


AXH  = 


m  m 

l  au  l  -a  bji 

x  J 


m  m 

•  •  y  ■  y  x . .  /’ . 

L.  h  L.  i  j  in 
i  J 


(3.6.16) 


The  general  element  of  AXB  Is  further  expanded  as 

tn  m 


X 

b 

»• 

lp 

11 

kl 

2p 

'12 

x 

l  a 

h  . 

J* 

lv 

21 

k2 

2  p 

JP 

l> 

x  , 

k  - 

1  ,m 

km 

mp 

mm 

a,  ,  r  x. 
kl  mp  Im 


mp  ?p 


In  vector  form  this  becomes 

m  m 

J"  y  a.  .  b  .  x.  .  =  (a,  , 

L.  h  ki  jp  i  j  kl 

t  j 


y  y  a,  .  b  .  x .  .  =  (a,  ,  h.  it.  ,  />  ...  a,  I  )  x , 

h  h  ki  jp  i  j  kl  lp  kl  2p  km  mp  12 


(3.6.1  ' 


Arranging  the  elements  of  (3.6.10  Into  a  column  vector  and  using  (3.6.  r) 
results  in 


m  m 

y  y  a  .  b  .  x.  . 
i  j  11  3*  >3 


m  m 

l  l  ali  bjn  xi  1 


m  m 

1 1  au  bn  xn 

0 


(1  m  f  B  _  -  .  .  .  (2  _  - 

11  11  lm  ml 


a„  ii.  . 

11  In 


all  bll  * 


.  a ,  h 
lm  mn 


•  a,  o  , 
lm  ml 


1  I  b.  x..  a7,  b.  .  .  .  a,  b 

L.  h  li  jn  ij  ll  In  lm  mn 


(3.6. 18) 


100 


Defining  the  elements  In  the  nl  *  m2  matrix  on  the  right  as  the  elements  of 
c  completes  the  proof. 

In  view  of  this  theorem  we  also  have  the  following: 

Corollary.  Given  the  conditions  of  the  Theorem  I  above,  let  X  and 
AXB  be  symmetric.  Then  AXB  may  be  placed  In  the  vector  form  c.v  where 


t  , 
x  «  (x 


11 


...  X ,  £„„ 

lm  22 


...  x„  ...  x  i  Is  a  vector  of  m(nn-i)/2  elements  and 
2m  mm 


c  Is  an  ln(ln+l)/2  x  m(m+l)/2  matrix  with  elements 


a  b  ,  for  q 
pq  rs  J  n 


a  b  +  a  b  >  for  q  ^  r 
pq  rs  pr  qs  w  ^ 


(3.6.19) 


and  where 


p  *  I (i+t~l)/n  +  1 1 
q  -  \(j+u-l)/m  +  1\ 
r  -  [ mod(j+u-l ,  m)  +  1] 
a  =  [ mod(i+t-l t  n)  +  1] 
t  =  \i/n\  and  u  =  |j/ml 

The  equations  for  the  Indeces  of  the  terms  in  Theorem  I  and  the  corollary 
follow  by  Induction. 

Since  X  Is  symmetric,  x^  =  x ^  ,  i  t  j.  Then  the  redundant  elements 
of  x  may  be  eliminated  so  that  x  =  (x.,...x,  x„„.  ..x„  x__...x-  ...x  ) 

11  lm  22  2m  33  3m  mm 

is  m(ml)/2.  In  Cx  the  elements  of  c  corresponding  to  x.^t  for  i  >  j,  are 
simply  added  to  those  corresponding  to  x^,  for  i  <  J.  Further,  the  synmetry 
of  AXH  yields 


mm  mm 

I  I  a^j  x-j^j  =  l  l  a  j  xj  •  b  .. 

H  H  ki  ij  jp  H  H  pi  ij  jk 

*  U  *  d 

hence  the  duplicated  rows  of  c  need  not  be  Included.  As  a  result,  the 
elements  of  c  are  given  by  (3.6.19). 
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In  light  of  this  corollary,  the  validity  of  (3.6.15)  is  established. 
A  simple  example  will  serve  to  Illustrate  the  application.  Consider  a  2*2 
<?-matrix,  with  ht  as  a  3- vector.  Then 


qn  qi2 
q2i  q 22 


H  =  l h2  h3 ) 


'll  Y 12 
y21  y22 
y31  y  32 


and 


HT  *  [hz  h2  h3 ] 


yil  y 12 
y  21  y  22 
y  31  y  32 


[f  hi  Yu  f  hi  yjj] 


(3.6.20 


Making  the  Identifications 

A  =  bt  -  //  r,  X  «  Q 


then  application  of  the  corollary  to  Theorem  I  yields 
xT  5  ,T  -  <qu  qu  q22) 

and 

C  5  J  ■  [(  \hi  Yi  l)‘  2{\Hi  YJi)(^  hi  Yi2)  (f  hi  Yi2 

Note  that  in  cases  where  J  has  only  cne  row  (as  In  this  example),  then  we  can 
write  the  equaLity 

H  T  Q  rT  HT  =  .1  q  (3.6.2V 

This  example  corresponds  to  the  case  of  scalar  observations.  In  line  with 
our  previous  developments  we  continue  with  this  assumption,  although  It  Is 
not  a  restriction. 
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With  the  validity  of  (3.6.11)  established  and  Its  elements  determined.  It 
Is  Important  to  consider  the  variance  of  This  Is  necessary  as  the  Inverse 
of  this  variance  and  some  of  Its  elements  appear  In  the  filter  gain. 

Defining 

&k  ~  J k  +  ®k  ~  k^k-1  ~  ^k-1^  +  (3.6. 

and 

V>  -  *«*’> 

then 

- J*  £{Vi  vTi>  V +  \  E,vi  V 

♦  s{et  1^}  JkT  t  £(etJ>  is.e.rs) 


Letting 


r* 


) 


-  =  r(Vj  01 

and 

rk .  mkn 

then  ( 3.6.23 )  becomes 

G  =  J  S  JT  +  ?.  J  C  +  7 


(t.e.r-;) 


(3.6.25) 


In  the  ordinary  Kalman  filter  the  observation  error  Is  assumed  to  be  uncor¬ 
related  with  the  a  piiorU  state  estimate.  Here  this  case  is  not  generally 
true;  both  0  and  q  are  functions  of  u,  and  this  correlation  is  reflected  in 
c.  However,  it  is  difficult  at  best  to  determine  appropriate  values  for  the 
elements  of  c.  Thus,  in  view  of  this  lack  of  knowledge,  a  viable  alternative 
is  simple  to  assume  c  -  o  so  that  (3.6. 24)  becomes 
G  =  J  S  JT  +  7 
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This  expression  Is  directly  analogous  to  the  observation  residual  variance  of 
the  ordinary  Kalman  filter. 

We  now  lack  only  a  "dynamics"  relationship  governing  the  prediction  of 
q.  Assume  that  q  is  linearly  related  to  q  by  an  appropriate  state 
transition  matrix,  y(t,  t,  J: 

k-1  k-2 

qk-l  *  ^ k-l, k-2  qk-2  (3.6.26) 

If  the  process  generating  the  elements  of  state  noise  vector,  u,  is  wide-sense 
stationary,  then  the  state  noise  covariance  matrix  is  constant.  Making  this 
assumption  we  have 


qk-l  *  qk-2 

and  hence  T  Is  the  identity  matrix,  r.  Such  an  assumption  may  not  be  overly 
restrictive  since  usually  the  dynamic  model  governing  x  can  be  determined  such 
that  u  Is  at  least  bounded  over  the  interval  of  interest,  t  <:  t  <  tf. 

We  are  now  In  a  position  to  write  down  the  estimation  equations. 

Making  use  of  the  definitions  (3.6.24)  and  employing  the  assumption  c  =  0,  the 
appropriate  relationships  may  be  written  directly  as 


Sk  ~  Sk-1  *  qk-l  “  qk-2 


(3.6.27a) 


Mk  =  Sk  Sk  +  Tk) 


(3.6.27b) 


qk-l  =  qk-l  +  ^k^k  ~  k  qk-l ^ 

sk=(I-  Mk  Jk>  V7  -  MkT)  +  Mk  \  \T 


(3.  6.  2  "a) 


(  3.6. 77 d) 


where  Is  the  filter  gain,  and 

sk  * 

There  are  additional  restrictions  which  must  be  set  forth  before  the  algorithm 
may  be  used.  Recall  that  q  must  be  non-negative  definite.  However,  it  is 
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not  unlikely  that  (3.6.27a)  may  produce  an  estimate  q  which  does  not  conform 
to  this  requirement,  particularly  during  Initial  (transient)  operation  of 
the  filter.  In  order  to  accommodate  the  non-negative  definite  constraint 
we  call  upon  the  properties  of  the  covariance,  Q.  First,  recall  that  for 
the  diagonal  elements,  <?ii, 

qn>0  (2.6.29) 


Also,  the  ,  i  O',  may  be  written  in  terms  of  the  correlation  coefficient, 


v as 


qij  ■  p«  /<?ji  *7) 


(2.6.30 


where 


-2  £  Pij  £  1 

Now  since  (3.6.29)  and  (3.6.30)  are  Inherent  properties  of  a  covariance 
matrix,  and  since  any  covariance  matrix  must  be  non-negative  definite, 
(3.6.29)  and  (3.6.30)  may  be  used  to  insure  the  restriction  on  Q  Is  met. 

Thus  If  any  diagonal  element  Is  estimated  to  be  negative,  It  Is  then  set 
to  zero.  Correspondingly  the  appropriate  gain  element  Is  recomputed  to  agree 
with  this  modification: 

0  «  q ii  -f  m( A  -  J  q) 
or 

m  m  -qn/(I  -  J  q)  (3.6.31) 

where  m  Is  the  corresponding  element  of  After  performing  this  test,  if 


Wijl  >  T}, 

then  set 


(3.6.20 


(3.6.32) 


where  the  superscript,  m,  means  the  value  computed  by  (3.6.2b),  and  where 
the  max  and  min  functions  mean  take  the  respective  maximum  or  minimum  values 
of  the  arquments.  Further,  the  corresponding  gain  term  Is  modified 
as 

m  =  (StJ  -  q.  .)/(!  -  J  q)  (3.6.14) 

* ij  U 

The  foregoing  restrictions  do  not  destroy  the  unbiased  property  of  the 
estimate,  i.e.,  E{q)  =  q.  Further,  since  the  Kalman  filter  normally  produces 
a  minimum  variance  estimate,  the  restrictions  on  q  result  in  a  constrained 
minimum  variance  estimate.  Of  course,  this  is  true  only  within  the  validity 
of  the  assumption  £{<78}  =  0. 

No  mention  has  yet  been  made  as  to  the  choice  of  the  residual  error 
variance,  T.  Since  It  Is  somewhat  difficult  to  select  this  value,  it  may  be 
estimated  also.  One  way  Is  to  use  a  method  proposed  by  Tapley  and  Born  (43). 
The  technique  Is  simple  and  straightforward,  and  is  the  average  of  the  a 
poAtzfuofU  residuals  given  by  the  following  recursive  equation: 


T 


Ac 


,T 


k 


( >  t1 .  33 ) 


An  initial  value  which  may  be  used  Is  ?  =  A^?,  since  if  q  7  =  6, 

P  =  0Z  =  T 


Since  (3.6.36)  Is  the  average  of  all  the  residuals  up  through  as  k 
becomes  larqer,  each  new  residual  has  less  effect  in  determining  7.  This  is 

Ac 

acceptable  If  0  is  wide-sense  stationary.  In  which  case  r  is  constant.  If 

this  is  not  the  case,  then  a  modification  which  offsets  this  effect  is  to 

use  only  the  last  n  residuals.  Thus  for  k  <_  .v,  equation  (3.6.33)  is  employed. 

At  each  t.  the  fcth  residual  Is  saved.  Then  for  k  >  N  use 
Ac 


‘Ac-2 


J. 


/v  1  \ 

qk-l} 


"  N  (^k-N  “  l  k~N  k-N-1 


(3.6.  36) 


Instead  of  (3.6.33),  and  continue  to  save  the  most  current  N  residuals. 
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The  Implementation  of  the  sequential  state  noise  covariance  estimation 
algorithm  Is  diagrammed  In  Figure  3.5. 

Although  we  have  been  concerned  here  with  estimating  only  the  elements 
of  Qt  It  should  be  obvious  that  the  algorithm  may  be  extended  to  Include  the 
simultaneous  estimation  of  the  observation  error  variance  /?.  In  this  case, 
an  augmented  state  vector,  a,  is  defined  as 
«  2  (q  /? )T 

Then  A  and  J  are  redefined  as 

A  =  -  n  T  llT  =  ./  ;;  +  0 

and 

<!  =  (G  1)  ( 

The  corresponding  estimation  equations  are  of  the  same  form,  although  r  is 
replaced  by  r2,  and  a  replaces  q. 

It  was  indicated  earlier  that  the  assumption  of  scalar  observations 
Is  not  a  restriction.  Consider  the  case  where  //r  is  1  x  m.  Then  tiVJYTHT 
Is  Z  x  /..  Correspondingly, 

r~rr  =  fI4>r$THT  +  H  V  VT  HT  +  /,'  +  0  .  ,  ' 

is  also  l  x  Z.  Since  the  terms  of  C6.6.S8)  are  symmetric,  the  corollary  to 
Theorem  I  applies,  and  we  can  write 

*r2  =  *U1  <t>  P  4>r  UT)  +  J  q  +  *P  +  *B 


where  each  of  the  terms  is  l  ( l  +  V/2  x  i.  For  instance.  If  Z  *  r, 

*r2  =  (r}2  rlr2  r22;r  with  similar  arrangements  for  *(H$F$rHT) ,  *Et  and  *9. 


Thus, 


11 
712 
7 22 


*(H  HT)  =  *V(0 )  * 


10? 


Figure  3.5  State  Noise  Covariance  Sequential  Estimator 
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*/?=  /? 


(3.6.40) 


The  rest  of  the  algorithm  is  obtained  in  a  straightforward  manner,  similar  to 
that  for  Algorithm  I  (Chapter  1),  subject  to  the  non-negative  definite 
restrictions  for  q. 

Finally  we  note  that  the  estimator  developed  above  suffers  certain 
disadvantages.  One  of  the  more  serious  is  the  assumption  of  a  specific  model 
for  Q.  This  may  be  reasonable  for  a  portion  of  the  filter  operating  range. 
However,  if  the  state  noise  process,  u,  changes  its  behavior  drastically  after 
steady  state  conditions  have  been  essentially  reached,  the  filter  has  no 
power  to  adapt  to  the  corresponding  new  value  of  J.  Recall  that  this  Is  due 
to  the  fact  that  the  associated  error  covariance  practically  vanishes,  and, 
similarly,  so  does  the  corresponding  gain.  This  situation  is  exactly  analogous 
to  the  original  dynamic  model  error  problem,  and  is  the  most  serious  disadvan¬ 
tage  of  this  ^-estimator.  However,  this  being  the  case,  we  can  apply  any  of 
the  previously  developed  compensation  algorithms,  including  another  state 
noise  covariance  sequential  estimator. 

For  example,  an  obvious  method  Is  to  include  a  state  noise  fourth 
moment  matrix*,  w.  In  this  way  the  prediction  of  s  given  by  (3.6.27a)  is 


*  By  using  another  filter  to  estimate  W,  this  approach  could  be  continued 
ad  infinitum,  becoming  computationally  very  burdensome  very  quickly.  The 
author’s  opinion  is  that  anything  much  more  complex  than  (3.6.41)  is  seldom 
practical. 
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modified  to 

-  Vi  *  Vi  ,3-e-41> 

3.7  Summary 

This  chaoter  has  presented  a  selection  of  adaptive  optimal  and  sub- 
optimal  algorithms  to  compensate  for  model  errors.  In  all  cases  the  emphasis 
has  been  on  simplicity  and  generality.  Jazwinski's  algorithm  for  estimating 
the  state  noise  covariance  matrix  has  been  shown  to  be  optimal  based  on  a 
single  predicted  residual.  By  using  the  mean  of  a  sample  of  squared  predicted 
residuals,  an  element  of  smoothing  Is  Introduced,  but  also  at  the  expense  of 
Introducing  suboptlmallty . 

The  basis  of  Jazwlnskl's  algorithm  has  also  been  shown  to  be  applicable 
to  a  number  of  the  algorithms  of  Chapter  2,  allowing  the  suboptlmal  parameters 
to  be  adaptively  determined. 

In  section  3.3  the  approach  (different  from  all  the  previous  ones)  of 
attempting  to  estimate  the  actual  model  errors  has  been  taken.  This  adaptive 
method  has  been  shown  to  require  the  assumption  of  a  particular  functional 
form  to  represent  the  structure  of  the  model  error.  Several  examples  have 
been  presented,  and  a  general  functional  form  developed  to  allow  a  measure 
of  structural  adaptation.  This  has  been  In  the  form  of  a  generalized 
Ornsteln-Uhlenbeck  process,  capable  of  structurally  approximating  (having 
the  approximate  functional  form  of)  a  number  of  different  functions. 

A  Kalman  filtering  algorithm  for  sequentially  estimating  the  state 
noise  covariance  matrix,  <,?,  has  been  developed  in  section  3.6.  The  algorithm 
Is  somewhat  more  complex  than  those  previously  presented,  but  has  been  shown 
to  provide  a  minimum  variance  estimate  of  v,  subject  to  the  constraint  that 
Q  be  non-negative  definite. 


Chapter  4 


APPLICATIONS:  RENDEZVOUS  MISSION 

4.1  Introduction 

In  this  chapter  the  algorithms  presented  In  the  foregoing  chapters 
are  applied  to  the  rendezvous  problem  formulated  in  Section  1.4.  The  Interest 
here  Is  to  gain  familiarity  with  the  operation  of  the  various  estimation 
algorithms.  Through  the  development  of  explicit  equations,  and  plots  of  their 
behavior,  qualitative  insight  may  be  gained  into  the  performance  of  the 
algorithms. 


**  *  *  */*> 
A  “  " 

bk  =  b  +  Klb  -  p] 


Pk  *  s  [J  -  K]  P  (4.2.1) 

where  It  Is  tacitly  assumed  that  r  Is  constant.  Starting  with  P(tQ)  «  PQ, 
the  gain  and  covariance  recursions  above  may  be  applied  to  arrive  at  the 


Ill 


steady  state  gain  and  covariance  values*.  Thus, 

k  =  1: 

p  =  p 

1  0 

K1  ‘  P1IP1  *  R/ei'1 

PJ> 

*  (PQ  +  R/e) 

'‘l  =  1  \(VQ  +  H/b)]  ‘  o 

R  PQ 

(PQ  +  R/e) 

k  *  2: 

po  R 

P2  P1  (PQ  +  H/b) 

K2  =  P2lP2  *  R/a]~1 

J‘Q  H/(VQ  +  U/tt) 

3  P^Ji 

(I iQ  +  H/b)  *  R/n 

r  V«^/°2  ]  p0« 

2  \_P  (1  +  1/a)  +  R/e* j  (PQ  +  H/b) 

*  po  R/lpo(1  +  1/8 }  +  R/e2] 


1 


*  The  gain  and  covariance  update  equations  could  be  used  to  solve 
directly  for  the  steady  state  conditions.  However,  using  the  recursion 
equations  to  compute  values  recursively  and  then  inductively  determining  the 
steady  state  conditions  provide  insight  into  the  operation  of  the  algorithm. 
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P 3  M  P 2  ~  PQ(1  +  1/e)  +  R/s2 

K3  «  P3[P3  +  R/eV1 

P0  R/{p0n  *  1/o)  +  R/a*] 

*  PQ  R/[P  (1  +  l/s)  +  R/e 3  ]  +  R'n 

ro _  'o  _ 

"  VQ  +  l’0(l/n  +  1/a2)  +  R/a 3  -  V  (l  +  1/a  +  la-'  4  .7 


P3  “  P0  R/lPo(1  +  1/8  +  U°2)  +  H/" ‘ 1 

Continuing  the  recursion,  one  obtains  for  k  =  m: 


m  m~l 

p0  £  VsJ  +  R/sm 

J-0 


poR 


m  m-1  . 

pn  £  l/ej  +  R/sm 

°  3-0 

The  summation  In  K  and  p^  is  simply  a  geometric  series  with  sum,  as 


£  i/*j 


3-0 


1 

1  -  1/8 


Thus,  as  m  -*■  ®,  k  and  P  become 

m  m 


where  lim  R/em  *  0,  since  a  >  2,  and 


a  -  2 


/? 


\ 


(4.2.2) 

n  », 

rtf.  2. 3; 

(4.2.4) 


a 


(4.2.5) 


113 


For  very  large  k ,  the  estimate  of  p,  including  the  first  order  model  error 
effects,  is  found  to  be  approximately 

*  S'7~  ,0oy sinh  ytk  *  '“o  aoah  ytk  -  K  *  V 

or 


K  "  i  K  *  (,v  >'k 1 >\  '  v 

Using  (1.4.25),  the  corresponding  error  is  given  approximately  by  the 
following  expression 

i!  -  1 


P*  “  a  <V  V-k  *  Pp  «*>«>»  l  P 


a 


r-i.i*.  r; 


For  large  e,  the  effect  of  the  unbounded  error  term  (first  term  on  the  right) 
Is  small.  Correspondingly,  the  random  effects  are  more  predominant,  and  in 
the  limit  as  a  •+■  °°  the  error  is  due  strictly  to  measurement  noise.  Because 
the  time  correlated  term  completely  dominates  the  noise  term  when  k  is  large, 
continued  filter  operation  is  possible  only  when  a  Is  very  large.  In  fact, 
best  performance  results  here  when  e  is  very  large  since  then  all  the 
information  Is  derived  from  the  most  current  observations.  As  expected 
(section  2.1)  the  error  covariance  in  this  case  becomes,  by  (4.2.5) ,  E. 


4.3  Schmidt  Suboptimal  Filter:  Gain  Scaling 

In  section  2.2  It  was  Indicated  that  the  use  of  a  constant  value  of 
b  can  lead  to  certain  difficulties.  This  Is  exemplified  here  when  one 
attempts  to  find  the  steady  state  gain  and  error  variance.  The  error  co- 
variance  recursion  is  used  to  solve  for  the  steady  state  variance  directly: 


- <zb  -  bl>  rh 


1  P, 


m 


In  steady  state,  P  -  PM  »  P. 

'  it j  w 


P=[l-  (2b-  b2)  3  P 


-(2b  -  b2) 


P  +  R 


(4.3.8) 


This  Implies  either  Pm  *  0  or  b  *  2.  Thus  for  l  <  b  <  2,  It  simply  takes 
longer  for  the  steady  state  condition  of  r  -  0  (and  hence  K  =  o)  to  be  reached. 
For  b  *  2,  P  *  P0>  In  this  case,  the  estimate  eventually  diverges. 

For  the  method  where  (2.2.12)  and  (2.2.13)  are  employed,  the  steady 
state  variance,  P  ,  is  found  not  to  vanish,  since  in  this  case  r  is  found  to 

CO  oo 


00  2  -  a 


(  t .  t> .  8 ) 


The  corresponding  gain  is  simply 


K  «  a 

oo 


(4.3.10 


As  a  lt  only  the  most  recent  observations  contribute  Information.  Con¬ 
sequently,  ->  1  and  P w  -*■  r. 

The  corresponding  estimate  and  error  for  large  k  are  determined  as 

p*  + 1  f  v  +  p» aoeh  -  K  *  V 


n  -  T)  h*T;  'V  Yf*  *  '°o  V'k  *  V 


t  id 


(1  -  f  )  (P0Y  sink  ytR  +  pQ  cosh  ytR)  -  (1  -  |  ^  i* 


n.s. 12) 


When  b  is  determined  by  equation  (2.2.14) ,  the  steady  state  gain  and  variance 
are  also  seen  not  to  disappear.  Using  the  covariance  recursion,  one  finds 
for  the  steady  state, 
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Zb  -  b1 


2&R  B2  R* 
P  "  ~P 


p  =  [i  -  n  -  b2  p7p2;  (p  l  R)  i  p 


*  [p  -  (1 


32  P2  .  P2  . 

p?  '  rp  +  p; J 


.  rp  +  b2  h) 
1  p  +  r< 


Solving  for  P,  one  finds  that 

P«  M 

For  the  steady  state  gain, 


K'  =  bK  =  (1  +  B  | ; 


P  '  P  +  R 

which  yields,  on  substituting  for  p  from  (4.3.13)  t 


K'  =  bK 


2  B 
B  +  1 


(4.3,  13) 


(4.3.  ID 


When  8*0,  one  has  the  optimal  Kalman  filter,  and  the  steady  state  values 
of  gain  and  variance  are  zero.  For  B  *  1*  P  -  R  and  K'  «  1,  as  expected. 

For  very  large  k,  the  estimate  of  p.  Including  the  first  order  model 
error  effects.  Is  found  to  be  approximately 

K  *  K  *  rh  ‘v alnh  ytk  * p  aoah  Yt*  -  K +  v 

or 

K  »  f-j-f  P*  +  bVj  1poy  sin*  Yt*  +  pp  aosh  Yt*  +  ”*>  N- *■ 16> 

The  corresponding  error  Is  found  to  be  approximately 
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K  ’  frt  |(v  Hnh  ytk  *  po  °°,h  Y‘*]  -  frl p*  -  g¥r  "k 

(4.2.16) 

Again  the  results  are  as  expected.  For  3  *  o,  the  error  consists  of  only  the 
time  varying  term.  For  6  *  J,  the  error  Is  due  solely  to  the  measurement 
noise. 


4.4  Schmidt  Suboptlmal  Filter:  Additive  Gain  Term 

Using  the  same  procedures  as  in  section  4.3  for  />  =  ;  +  3  v  '/\ 
Identical  results  are  obtained  for  the  steady  gain  and  variance.  Using 
equation  (2.2.6) , 


P  =  (1 


P 

P  +  R 


)  P  + 


32  R2 
P  +  R 


(P  +  g 2  R)  R 
P  +  R 


or 


P  =  3  R 


(4.4.1) 


For  the  gain, 

P  +  R  P  +  R 

or  using  p  =  B  /?, 


i,  m  2  B 

M  B  +  1  (4.4.Z) 

It  Is  noted  that  the  difference  between  the  two  algorithms,  discussed  in 
section  2.3,  does  not  appear  here.  In  other  words,  for  the  gain  scaling 
suboptlmal  term,  the  variance  term  divides  out,  yielding  the  Identical 
results  In  gain  and  variance. 
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4.5  Limited  Memory  Filter 

To  apply  the  limited  memory  filter  to  the  rendezvous  problem,  assume 

A 

we  have  available  p  i  and  P  .  ,  and  It  Is  desired  to  obtain  the  limited 

m|m  m|m 

memory  estimate  over  the  set  of  (k  -  m)  observations.  The  necessary  quanti¬ 
ties  required  by  equations  (2.4.8)  and  (2.4.9)  are  given  below. 


Xk\k  m  PJt|*  =  +  Kk  rk\k-l 


*  P  |  +  T  K .  r.  |  , 

"I"  jStn  1 

where  the  predicted  residual  Is 


p  y  airih  yt .  +  p  cosh  yt .  - 


( ;.b.v 


and  the  gain  at  tj  Is 


K 


mm 


V-m>  Fm\m  *  *  ' 


m  +  1  _<  j  k 


(4.8.2) 


Similarly, 


mm 


R 


(k-m)  P 


mm 


+  R 


(4.5.3) 


and 


P.  |  -  P 

k  m  m  m 


(4.5.4) 


P.  \ . »  the  error  covariance  based  on  observations  from  t  to  t, ,  may  be 

k  |  (k-m)  mk 

found  directly  from  equations  (2.4.9)  or  (2.4.10)  or  may  be  computed  recur¬ 
sively  by  (2.4.12).  The  computation  using  equation  (2.4.9)  yields 


n  _  r p  ”  1  _  i> 

k  |  (k-m)  1  /c  | /c  *|m  J 


-t 


(k-m)  P  .  +  A’ 

_ juLe _ 

P  |  R 
mm 


mm 


]- 


\ 
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or 


Pk  I  (k-m)  ~  R/(k~m) 


(4.5.5) 


It  Is  noted  in  passing  that  for  k-m  »  1  ,  l.e.,  one  observation,  the  cor¬ 
responding  error  covariance  is  as  expected,  *  /?. 

Using  equation  (D.4.8)  the  limited  memory  estimate  Is  found. 

(k-m)  P  i  *  F  f  ~  k  /'  i  v ,  i  , 


W 


m 


i'  i  f  w  Ta  A-  /’  |  m  .  | .  ,  "i  pi  ) 

"»Lm. , A  .  +  V  ELm-J.li~.J _ SUE  ( 

/?  h  m  m  i  /  •  i  n  ,  ,,  /’  |  1 

m  1  ,i=rn+l  (,i-t'i)  /’  i  -f  /i  m  m  1 

mm  «* 


or 


,  /v  (k-m)P  |  +  F 

(k-m)  •  .  m|m 

~T~  PmU  +  t 


"jh zi  I 


,  (j-m)  P\  +  F  \ 

v  m|m  ' 


P k  |  (k-m) 


l  ,  +  rp  .  +  £ i  V'  _ ^Jliz1 - 

Pm  m  1  m  m  *  (k-m)  J  (,i-m)  Pi  +  F 

1  1  Q=m+1  *  m|m 


(4.5.6) 


Thus,  the  outputs  of  (4.5.5)  and  (4.5.6)  are  the  limited  memory  error  co- 
variance  and  estimate.  Note  that  when  k-m  =  /,  the  estimate  depends  only 
upon  the  most  recent  observation.  In  this  case  (4.5.6)  gives 

pk\(i)  =  pk-i\k-i  +  r*|*-i 

»  P0Y  sink  Y tk  +  P0  I'osh  yt-k  +  VR 


4.6  Numerical  Results 

The  foregoing  sections  have  considered  the  application  of  some  of  the 
error  compensation  methods  to  the  rendezvous  problem.  In  particular,  various 
closed  form  expressions  for  the  range- rate  estimate,  error,  and  steady-state 
gain  and  covariance  equations  were  derived.  Such  expressions  help  provide 
analytical  Insight  Into  the  filter  structures.  However,  the  performance  of 
the  algorithms  can  be  seen  best,  perhaps,  through  numerical  simulations. 


Thus,  In  this  section  numerical  results  are  obtained  for  each  of  the 
compensation  algorithms  previously  presented. 

To  carry  out  the  simulations,  nominal  parameter  values  defining  the 
dynamic  model  were  selected  as  shown  In  Table  4.1.  The  model  error  parameter, 
y,  is  defined  by  equation  (1.4.20)  as 


and  reflects  the  gravitational  parameter,  y,  the  target  vehicle  radial  distance, 
rfc,  and  the  llne-of-siqht  angle  (above  the  local  horizontal).  For  Earth  orbit, 
y  s  398603.2  km3/sec2.  Thus,  using  the  value  of  y  given  in  the  table,  values 
of  e  are  determined  for  various  values  of  | r*fc j .  For  example,  an  orbital 
altitude  of  160  km  (|i*fc|  =  6538.165  km)  corresponds  to  e  =  4.96°;  at  240  km, 
e  *  5.1°. 


Parameter 

Value 

Initial  range ,  p„ 

10  km 

• 

Initial  range  rate ,  p0 

-1  m/sec 

• 

Initial  range  rate  error ,  p0 

0  m 

A  pKiofu  variance,  P0 

10  (m/sec)2 

Observation  error  variance,  R 

.01  (m/sec)2 

Observation  interval,  ht 

10  sec 

Model  erx'or  parameter,  y 

.18  x  icT 3  (sec)'1 

Table  4.1  Nominal  Simulation  Parameters 


V 
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In  performing  the  simulations,  the  nominal  or  assumed  model,  as 
before.  Is  Pk  *  p^.  The  true  dynamics  are  computed  according  to  the 
linearized  model,  equation  (1.4.22).  The  true  range  rate  observations  are 
computed  by  adding  a  random  number,  distributed*  N(Ot  r),  to  the  true  range 
rate  from  (1.4.22).  The  observation  deviation  Is  then  found  by  (1.4.23)* 
and  the  filtering  equations  of  the  various  algorithms  are  applied  to  obtain 
the  gain,  estimate  and  updated  state  error  variance.  In  all  cases  Investi¬ 
gated,  the  true  initial  error  is  zero,  as  seen  from  Table  4.1.  Also,  to 
provide  a  common  basis  for  evaluation,  the  error  sequence  Is  identical  in  all 
cases . 


Standard  Minimum  Variance  Estimates.  To  provide  some  standards  of 
reference,  Figure  4.1  shows  the  behavior  of  the  ordinary  minimum  variance 
algorithm  In  the  presence  of  model  error  with  no  model  error  compensation 
techniques  employed.  Also  shown  Is  the  case  with  perfect  modelling.  The 
ordinate  axis  represents  the  estimate  error  in  meters,  with  the  abscissa 
measuring  time  In  seconds.  Three  curves  are  shown.  The  solid  line  represents 
the  actual  estimate  error,  p  ,  where  the  model  error  (1.4.25)  is  present.  The 

A 

dotted  line  gives  the  estimate  error  for  no  modelling  error.  The  dashed  line 
represents  the  computed  error  standard  deviation  (s.d.),  a  e  7p~  .  Note 

a  K 

that  this  curve  Is  the  same  for  both  cases,  regardless  of  the  presence  or 
absence  of  model  errors,  since  the  estimation  algorithms  are  identical. 

For  the  Incorrect  model,  the  rapid  rate  at  which  the  estimate  of  the 
range  rate  diverges  Is  rather  astonishing.  Thus  at  typical  Earth  orbit 


*  The  notation  N(0t  R)  means  that  a  random  number  is  sampled  from  a 
normal  distribution  of  zero  mean  and  variance,  R.  In  the  simulations, 
normally  distributed  random  numbers  were  calculated  using  the  Central  Limit 
Theorem  (46,  98-99).  Further,  the  same  sequence  of  random  numbers  was 
repeated  from  run  to  run. 


(098/iu )  a.oj.j.3  aq.vy  36uvy 


Figure  4.1  Range  Rate  Error  With  and  Without  Model  Error 
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altitudes,  relatively  small  line-of-sight  angles  --  on  the  order  of  5°  -- 
produce  a  dramatic  divergence  of  the  range  rate  estimate.  On  the  other  hand, 
for  the  exact  model  (1.4.22)  the  power  of  the  Kalman  filter  is  reflected  by 
the  dotted  curve:  after  1500  seconds  the  error  has  practically  disappeared. 
Thus  these  two  curves  can  be  seen  to  bound  the  performance  of  any  of  the 
compensation  algorithms  investigated  for  this  problem. 

Commonality  of  the  Algorithms.  Because  of  the  fact  we  are  dealing 
with  a  single  state  variable,  it  is  possible  to  relate  most  of  the  suboptimal 
filter  parameters  to  obtain  equivalent  steady  stage  filter  performance.  In 
other  words,  through  appropriate  expressions  relating  the  filter  parameters 
to  each  other,  it  is  possible  to  obtain  a  value  for  each  parameter  which  will 
produce  equivalent  steady  state  performance  for  each  algorithm.  (Exceptions, 
which  are  treated  separately,  are  the  ordinary  [Z>  =  constant]  gain  scaling 
algorithm  and  the  limited  memory  filter.) 

Consider  the  steady  state  relations  for  the  age-weighting,  modified 
gain  scaling,  and  additive  gain  term  algorithms.  These  are  given  by  (4.2.5), 
(4.2.13),  and  (4.3.17),  or  (4.4.1)  respectively  as 


Pm  *  B  R 

(Recall  that  this  last  equation  is  the  same  for  the  modified  gain  scaling  of 
equation  (2.2.14)  and  for  the  additive  gain  term.)  For  equivalent  steady 
state  performance  each  of  the  above  expressions  must  be  equal  to  any  other, 
thus  providing  the  necessary  relationships  relating  the  filter  parameters. 
Therefore, 


\ 
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For  the  case  where  the  state  noise  covariance  is  employed,  the  steady  state 
expression  given  by  (1.4.32) ,  may  be  equated  to  the  corresponding  expressions 
containing  the  suboptimal  filter  parameters.  Thus, 


p  *  L±JL±  Vf  R  (1.4.32) 

1  +  /I  +  4/f  +  2/ f 

where  /»  Q/R%  may  be  included  In  (4.6.1)  so  that 

C-  a  _8~J_  1  +  Jl  +  4/f  (4.6.2) 

2  "  a  8  1  +  / 1  +  4/f  +  2/J' 

Using  these  equations,  a  table  of  values  may  be  constructed.  Table  4.2  gives 
such  appropriate  values  which  yield  the  equivalent  filter  performance.  The 
table  was  constructed  by  selecting  various  values  of  3  and  then  solving  for 
the  corresponding  values  of  a,  s,  and  Q. 


3 

a 

8 

0. 

0. 

1.0 

0. 

0. 

.10 

.  1818 

i.in 

.011  lh' 

.  m 

.20 

.  3333 

1.250 

.0500 

i'  •  ’ 

•  .  •  • 

.30 

.  4615 

1.429 

.  13 865' 

.  3R 

.40 

.5714 

1.667 

..-VV'V 

.  OR 

.50 

.6667 

2. 000 

.  5000!; 

• «» *Y 

.60 

.  7500 

2.500 

.90005' 

.  6!x 

.70 

.  8235 

33  & 

l.  0 

.  7:i 

.80 

.8889 

5.000 

.S.  300 .V 

.8!' 

.90 

.9474 

10.00 

8.  100! i 

.  9R 

.99 

.9950 

100.0 

98.  UR 

.99  R 

1.00 

1.000 

00 

lV> 

5 

Table  4.2  Filter  Parameter  Values  for 
Equivalent  Steady  State  Performance 
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figures  4.2a  through  4.2e  Illustrate  the  numerical  injui valence  in 
terms  of  filter  performance.  The  curves  were  actually  generated  using  the 
additive  gain  term  algorithm  for  8  »  .1,  .3,  .6%  .$,  and  l.o ,  respectively. 
However,  steady  state  conditions,  and  hence  equivalent  performance,  are 
quickly  reached.  These  curves  therefore  Illustrate  essentially  the  same 
performance  for  each  of  the  algorithms.  As  in  Figure  4.1,  the  solid  line 
represents  the  actual  estimate  error,  p^.  The  dashed  lines  represent  plus 
and  minus  values  of  the  computed  standard  deviation,  o  . 

/v 

In  examining  the  figures  one  finds  that  the  maximum  and  minimum  error 
values  Increase  with  increasing  values  of  the  filter  parameters.  In  the 
extreme  (Figure  4.2e),  the  estimate  follows  the  observations,  and  the  errors 
are  greatest  In  absolute  value.  Based  on  the  sample  of  384  points,  71.61% 
of  the  estimates  are  within  the  lo  value*  of  .1  meter,  determined  from  the 
error  covariance  computed  by  the  filter.  In  Fiqure  4.2a  the  absolute  value 
of  the  maximum  and  minimum  errors  are  smallest.  Here,  64.06%  of  the  384 
points  are  within  the  lo  value  of  .03163.  However,  the  model  error  appears 
to  be  showing  itself  in  the  form  of  a  bias;  the  majority  of  the  error  values 
lie  above  the  expected  value  of  zero.  In  Fiqure  4.2b  the  error  does  not 
exhibit  such  a  bias,  thus  suggesting  parameter  values  somewhere  between  those 
for  Figures  4.2a  and  4.2b  (e.g.,  .  1  _<  e  <_  . 3)  --  at  least  for  the  time  inverval 
of  3840  seconds.  However,  regardless  of  the  value  selected,  as  long  as  8  <•  1, 
divergence  can  be  expected  to  occur  eventually.  This  follows  from  the 
unboundedness  of  the  error,  equation  (4.S.20). 

Because  the  steady  state  variance  associated  with  the  ordinary  gain 
scaling  ( b  *  constant'  Is  zero,  the  equivalence  developed  above  cannot  be 

*  A  short  discussion  of  an  interpretation  of  o  in  light  of  normal 
ergodir  processes  is  given  in  Appendix  D. 
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obtained.  Thus  this  algorithm  results  In  a  completely  different,  and  not 
very  effective,  filter  performance.  This  is  seen  In  Figure  4.3,  which  shows 
the  estimate  errors  for  the  gain  scaling  algorithm  using  values  of  b  -  1.3, 

1.6,  and  1.9,  respectively.  Clearly  the  performance  Is  unacceptable;  diver¬ 
gence  occurs  soon  In  all  cases.  The  values  b  -  1.3  and  b  =  1.6  are  seen  to 
have  little  effect  on  the  estimation  performance.  Only  when  b  becomes  close 
to  2  does  the  algorithm  have  an  effect  in  delaying  the  divergence.  Even  then, 
the  variance  eventually  becomes  zero,  although  it  simply  takes  longer.  Recall 
from  equation  (4.2.12)  that  b  -  2  prevents  any  change  in  the  variance,  and 
hence  P  «  P„.  For  this  problem  the  ordinary  gain  scaling  is  obviously  not 
acceptable. 

Adaptive  Estimation  of  State  Noise  Covariance  and  Suboptimal  Parameters. 
Numerical  simulations  were  performed  for  each  of  the  adaptive  forms  for  esti¬ 
mating  the  state  noise  covariance  and  suboptimal  parameters,  as  presented  in 
sections  3.1  and  3.2.  In  these  simulations,  the  values  of  y  given  by  (2.1.13) 
and  (3.1.14)  were  used  (the  sample  mean  of  the  square  of  the  previous  N 
predicted  residuals.  Initially,  (3.1.14)  was  used  to  "build  up"  the  set  of 
N  residuals.  Subsequently  (3.1.13)  was  used  to  compute  y^.  Use  of  equation 
(3.1.12)  was  not  investigated  due  to  the  inherent  disadvantage  of  either  having 
the  estimate  lag  the  observations,  or  having  to  reprocess  or  smooth  the 
estimates. 

In  computing  the  various  parameters,  the  following  equations  were 
employed: 

Qk  -  max  [0.3  yN  -  l’K  -  A’]  (4.6.3a) 

8.  =  max  [1.,  (yt,  -  R)/l\  J  (4.$.  3b) 

K  N  K 

bk  =  max  [l.t  (yN  -  R)(Pk  +  R)/Fk ] 

N 

\ 


(  4.6 .  .v.) 
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Figure  4.3  Range  Rate  Error  for 
Standard  Gain  Scaling 


<xk  =  max  [hN  -  R)/Sn,  pk/(pk  + 


1?8 


( 4.6. 3d) 

$k  -  1.  -  (Pk  +  R)/max  [(Pk  +  R)t  y^l  (4.6.  Se) 

These  equations  yield  appropriate  values  for  the  parameters  according  to  the 
values  of  yw*  They  also  automatically  Incorporate  the  attendant  limits  on  the 
parameter  values,  which  Is  the  reason  for  the  presence  of  the  max  function. 

In  carrying  out  the  simulation,  some  particularly  Interesting,  if  not 
surprising,  results  were  obtained.  Three  different  values  of  N  were  used  In 
computing  yw»  namely  N  *  l,  10.  and  20.  In  each  case,  the  algorithms  exhibited 
very  similar  behavior,  with  the  exception  of  the  ordinary  gain  scaling  algo¬ 
rithm.  In  fact.  It  was  found  that  the  performances  of  Jazwlnskl's  adaptive 
^-estimator  yielded  results  Identical  with  the  adaptive  age-weighting  algo¬ 
rithm.  Further,  Identical  results  were  also  obtained  with  the  adaptive  forms 
of  the  two  modified  gain  scaling  algorithms  and  the  additive  gain  term  approach, 
of  Schmidt.  However,  some  small  differences  were  noted  between  these  two  sets 
of  performances.  The  simulation  results  are  shown  for  each  N  In  Figures  4.4, 
4.5,  and  4.6,  respectively. 

Figure  4.4a  shows  the  estimate  error  and  tla  curves  for  Jazwinski's 
^-estimator  and  adaptive  age-weighting  with  N  -  1.  Here,  64.84%  of  the  384 
estimates  are  within  the  computed  la  of  the  true  value.  Very  similar  estimate 
error  behavior  is  shown  In  Figure  4.4b  for  the  modified  gain  scaling  algo¬ 
rithms  and  additive  gain  term  technique.  Although  the  performances  are  nearly 
the  same  for  the  two  sets  of  plots,  In  this  latter  case  only  59.35%  of  the 
384  estimates  are  within  la  of  the  true  value.  On  this  basis  the  adaptive 
forms  of  the  state  noise  covariance  and  age-weighting  algorithms  appear  to 
offer  greater  confidence  In  their  associated  estimates.  This  suggestion 
seems  to  be  further  substantiated  by  Figures  4.5a  and  4.5b.  Here  the  same 
type  of  Information  is  plotted  as  In  Figures  4.4,  but  now  N  *  IP.  The 


Figure  4.4  Adaptive  Algorithms  for  1  Residual 


Fiqure  4.5  Adaptive  Algorithms  for  10  Residuals 


Figure  4.6  Adaptive  Algorithms  for  20  Residual 


percentages  of  the  384  points  lying  within  ±la  of  the  mean  are  55.99%  and 
50 . 78%  respectively  for  Figures  4.5a  and  4.5b.  The  same  trend  is  present 
for  the  case  where  N  =  ?.0,  Figures  4.6a  and  b.  Here  the  respective  percentages 
are  59.9%  and  51.30%.  Thus  the  implication  is  that  the  former  algorithms 
offer  greater  confidence,  as  measured  by  the  computed  covariance. 

Another  notable  characteristic  is  present  in  each  of  Figures  4.5a  and 
4.5b.  Close  examination  suggests  the  appearance  of  the  model  error  bias, 
since  the  majority  of  the  error  values  appear  above  the  expected  zero  value 
on  both  sets  of  piots.  This  implies  that  when  more  residuals  are  used  In 
forming  y  ,  the  bias  errors  have  more  opportunity  to  affect  the  estimate. 
Increasing  the  residual  sample  size  to  N  =  20  yields  the  plots  shown  In 
Figures  4.6  which  confirm  this  suspicion.  In  retrospect  this  should  not  be 
surprising;  as  N  ■*  °°  the  adaptive  algorithms  cease  to  operate,  and  the  bias 
errors  dominate  producing  divergence  of  the  estimate. 

For  each  of  the  values  of  N  =  1,  10,  20,  the  adaptive  form  of  the 
ordinary  gain  scaling  algorithm  yields  the  results  plotted  in  Figures  4.4c, 
4.5c,  and  4.6c.  Clearly  these  results  are  unacceptable.  Initially  the  error 
variance  is  large;  hence  the  adaptive  algorithm  obtains  no  information  from 
the  residuals  since  yN  is  generally  small  and  hence  (4.6.3c)  produces  bk  -  1. 

As  the  process  continues,  yN  tends  to  increase,  but  the  sensitivity  of  the 
algorithm  is  simply  not  sufficient  to  control  the  divergence  of  the  estimate. 
There  appears  to  be  a  simple  explanation  for  this.  For  the  adaptive  form  of 
each  algorithm  consider  the  sensitivity  of  the  gain  to  changes  in  the  residual 
sample,  y  .  This  is  determined  by  using  the  appropriate  member  of  equations 
(4.6.3)  in  forming  the  corresponding  gain,  and  then  taking  the  partial  deriva¬ 
tive  with  respect  to  y  .  In  all  but  the  case  of  the  ordinary  gain  scaling 
algorithm,  the  partial  derivative  is 
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<)  y.  a  ,  . 

V  V 

For  the  ordinary  gain  scaling, 

dK  l 

3y.  "  P  > 

N 

From  these  equations  the  reason  for  the  very  different  behavior  should  be 
apparent.  First,  note  that  when  the  residuals  are  very  small  the  sensitivity 
given  by  (4.6.4)  Is  very  high.  This  is  desirable  since  the  gain  values  then 
respond  quickly  to  changes  In  y„.  However,  for  gain  scalinq,  a  similar 
sensitivity  requires  that  P  be  very  small.  This  is  exactly  the  type  of 
behavior  displayed  In  Figure  4.6c.  Only  when  the  error  variance  has  decreased 
significantly  does  the  adaptive  feature  begin  to  operate.  Thus  based  upon 
the  Indicated  performance,  the  ordinary  gain  scaling  algorithm  will  be  dis¬ 
missed  from  further  consideration  as  a  viable  filtering  algorithm. 

No  discussion  has  been  given  yet  concerning  the  computed  values  of 
the  state  noise  variance  or  suboptimal  filter  parameters.  An  obvious  approach 
is  simply  to  "shotgun"  the  reader  with  a  set  of  some  21  graphs  of  estimation 
curves  and  then  pick  through  the  data  a  "pellet"  at  a  time.  A  preferred 
approach,  and  the  one  followed  here,  is  to  offer  a  representative  sample  and 
then  to  note  certain  trends  determined  from  examination  of  this  and  previous 
data.  In  this  manner  we  hope  to  avoid  "blowing  the  reader's  mind"  *  and 
still  provide  a  further  measure  of  Insight  into  the  behavior  and  relationships 
of  the  algorithms. 

We  proceed  by  offering,  as  the  representative  sample,  plots  of  the 
state  noise  standard  deviation,  k  =  Sq,  given  in  Figures  4.7a  through  c. 

Figure  4.7a  gives  the  values  of  <  determined  from  Jazwinski's  ^-estimator 


*  This  obvious  pun  could  not  be  resisted. 
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•  7b  10  Residuals 


4.7a  1  Residual 


Figure  4.7  State  Noise  Standard  Deviation,  < 
for  Various  Residual  Sample  Sizes 


based  upon  single  residuals  ( N  =  /  for  yN).  During  the  initial  phase  of 
operation,  while  the  state  error  covariance  is  reasonably  large,  few  non¬ 
zero  values  of  Q  are  computed.  After  800  seconds,  the  error  covariance  has 
decreased,  and  there  is  a  high  frequency  of  non-zero  estimates.  In  Figure 
4.7b,  the  values  of  k  are  shown  based  upon  a  10-res  1  dual  sample  size  (.7  * 
for  yN  ).  Here  both  the  frequency  and  magnitude  of  computed  values  of  <  are 
less  than  for  the  single  residual  case.  Figure  4.7c  continues  the  trend. 

In  all  cases  the  estimated  values  of  Q  clearly  result  in  the  desired  increases 
of  the  la  curves  of  Figures  4.4a,  4.5a,  and  4.6a.  Note,  however,  that  as 
the  residual  sample  size,  N,  increases,  the  state  estimate  bias  error  becomes 
more  prominent.  As  pointed  out  earlier,  as  N  -*  ™,  we  eventually  have  no 
£-esti mates  being  produced,  and  thus  the  state  estimate  diverges. 

For  the  suboptimal  parameters,  excluding  ordinary  gain  scaling  as 
stated  earlier,  similar  results  have  been  consistently  obtained.  Of  course, 
the  limiting  values  are  different;  hov^ver,  the  parameter  estimates  show  the 
same  random  "spike"  behavior  as  those  for  k.  Similarly,  the  frequency  and 
magnitude  of  the  values  decrease  with  Increasing  N%  resulting  in  eventual 
dominance  of  the  bias  error  in  the  state  estimate.  Further  evidence  of 
similar  performance  for  the  suboptimal  parameter  adaptive  estimation  is 
given  by  the  similarities  of  the  error  curves  in  Fiqures  4. 4-4. 6.  If  signifi¬ 
cant  differences  existed  in  the  adaptive  computation,  these  would  also  be 
manifested  as  significant  differences  in  the  various  error  and  lo  curves.  On 
this  basis  one  may  conclude  --  and  the  estimated  parameter  data  bear  this 
out  —  that  no  fundamental  differences  exist  in  the  performance  of 
Jazwinski's  adaptive  state  noise  covariance  estimator  and  the  remaining 
adaptive  suboptimal  parameter  algorithms  for  this  scalar  example.  (This 
last  qualification  is  important  and  will  be  discussed  in  section  4.7.) 
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Limited  Memory  Filter.  Figures  4.8a  throuqh  4.8c  show  the  estimate 
error  for  Jazwinski's  limited  memory  filter.  The  fl;u-es  correspond  to  the 
values  of  the  "memory  length,"  i.e.,  the  number  of  observations  processed 
between  applications  of  the  limited  memory  updates.  Thus  memory  lengths  of 
1,  10,  and  20  observations  are  reflected  in  the  performance  given  by  Figures 
4.8a,  b,  and  c,  respectively. 

In  Figure  4.8a  the  filter  follows  the  observations.  Comparison  with 
Figure  4.2e  clearly  indicates  identical  behavior,  as  expected,  with  the 
corresponding  steady  state  covariance,  l'm  *  h\ 

For  N  =  10  the  estimate  error  is  plotted  in  Figure  4.8b.  In  this  case 
the  bias  error  appears  as  an  excess  of  points  above  the  time  axis.  However, 
toward  the  end  of  the  interval,  the  central  tendency  is  back  toward  zero  mean. 
To  Investigate  this  further,  the  case  of  a  20-observation  memory  was  run.  The 
results  appear  in  Figure  4.8c.  As  expected,  the  bias  effects  are  even  more 
pronounced.  Indeed,  if  the  memory  size  is  made  as  long  as  the  complete  ob¬ 
servation  period,  the  performance  Is  simply  that  obtained  for  no  error 
compensation;  I.e.,  the  estimate  error  grows  without  bound. 

In  both  4.8b  and  c  the  typical  behavior  of  the  covariance  may  be 
noted.  This  is  expressed  as  the  saw-tooth  form  of  the  tic  curves.  Each 
peak  corresponds  to  an  increase  in  the  variance  produced  by  the  limited 
memory  updated.  After  each  update,  the  variance  begins  decreasing  in  accord¬ 
ance  with  the  operation  of  the  ordinary  Kalman  filter  equations.  The  width 
of  each  of  the  saw-tooth  pulses  Is  therefore  directly  proportional  to  the 
memory  length. 

The  accuracies  obtained  with  the  limited  memory  filter  are  generally 
on  the  order  of  those  obtained  by  the  adaptive  forms  of  the  suboptimal 
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algorithms.  However,  based  on  the  numerical  results,  the  memory  length  must 
be  smaller  than  the  residual  sample  size  used  in  the  adaptive  algorithms. 

For  this  scalar  problem  the  limited  memory  filter  requires  no  more 
computational  effort  than  the  other  adaptive  forms.  In  vector  problems 
involving  many  state  elements,  the  computational  load  becomes  quite  severe 
due  to  the  required  matrix  inverses.  For  this  reason  the  limited  memory 
filter  loses  its  attractiveness  for  use  In  more  complex  problems. 

Sequential  Estimation  of  the  State  Noise  Covariance.  We  turn  now  to 
the  performance  of  the  sequential  estimator  for  Q.  Initial  runs  of  3840 
seconds  were  made  using  a  jpfiiofU  values  for  the  error  variance,  S  =  eQz], 
the  residual  error  variance,  T  =  E{62},  and  the  initial  value  of  Q  as  follows: 

S0  -  1.0  (m/sea)'* 

To  -  V  -  (y,  -  bx)2 

Qq  =  0.  (m/sea)2 

The  performance  was  slightly  better  than  for  Jazwinski's  adaptive  ^-estimator, 
although  the  bias  error  was  apparent.  The  estimated  values  of  c  took  some 
time  to  "settle."  This  appeared  to  occur  near  the  end  of  the  interval.  To 
investigate  this  further,  the  interval  was  doubled  to  7680  seconds;  the 
results  are  shown  in  Figures  4.9a  and  b,  which  show  the  estimate  errors  for 
p  and  estimates  of  Q  respectively.  Also  shown  are  the  corresponding  p-error 
la  curves.  The  la  value  for  the  p  error  appears  to  have  a  mean  of  approxi¬ 
mately  .04  m/sec,  with  70.05%  of  the  error  being  within  la.  The  corresponding 
value  of  <  =  7q  is  around  .015  m/sec.  At  the  final  time,  the  confidence  in 
the  estimate  of  Q  is  reflected  by  a  la  value  of  7s  =  .027  (m/sec)2.  The 
data  indicated  that  all  the  (.'-estimates  lie  under  the  Js  curve,  suggesting  a 
decrease  in  the  a  puicni  S.  Though  not  immediately  obvious,  close  examination 
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of  the  /o- curve  and  the  >-curve  reveals  a  slight  trend  to  increasing  values. 
This  is  not  surprising  since  we  know  the  actual  error  is  increasing,  (the 
bias  error  is  obvious  here  in  Figure  4.9a),  and  hence  the  ^filter  is  simply 
trying  to  adapt  to  this.  Further,  one  can  expect  that  divergence  of  o  will 
eventually  occur  as  the  ^'-filter  saturates,  and  no  new  information  is  added. 
Recall  that  this  is  due  to  the  assumption  that,  the  error  is  a  stationary 
random  process,  i.e.,  constant  Q.  An  obvious  remedy  is  to  include  a  process 
(state)  noise  variance  term  in  the  ^-estimator,  as  suggested  by  equation 
Another  alternative  is  to  improve  the  assumed  model  so  that  the 
assumption  of  constant  Q  is  more  accurate.  This  approach  is  considered  in 
the  discussion  on  estimating  model  errors. 

A  notable  aspect  of  the  sequential  ^-filter  performance  is  the  fact 
that  the  h  curve  has  been  found  not  to  decrease  as  rapidly  as  for  the  ordinary 
Kalman  filter.  Consideration  of  the  constraint  on  the  estimate,  i.e.,  s 
reminds  one  that  the  filter  is  minimum  variance  only  in  the  a  pcstciicii  sense. 
Thus  the  variance  cannot  decrease  any  more  rapidly  due  to  this  constraint  on 
Q.  Further,  large  values  of  result  in  slow  decrease  in  This  behavior 
happens  to  hold  an  advantage.  Since  the  attendant  filter  does  not  decrease 
as  rapidly,  the  filter  operates  for  a  longer  period  of  time  before  saturating. 
On  the  other  hand,  the  estimates  take  longer  to  "settle  down."  Some  idea  of 
the  effect  of  different  a  p nLoni  values  of  the  error  variance,  may  be 
gained  from  Figures  4.10  and  4.11.  Figures  4.10a  and  b  show  the  p  error 
performance  and  estimates  of  Q  for  s0  =  S(t0)  =  .OK'v/sec)" .  The  performance 
of  the  range  rate  and  q  estimates  appears  to  be  about  the  same  as  those  in 
Figures  4.9a  and  b,  although  now  64.2%  of  the  error  is  within  la  .  Similarly, 
at  the  final  time  the  computed  standard  deviation  in  the  estimate  of  is 
7s  =  .  026  (m/ sea) 2  . 
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4.11a  Range  Rate  Error,  p 
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Figure  4.11  Range  Rate  Error,  p,  and  State  Noise  Standard 
Deviation,  k,  for  SH  =  10~k(m/8ea)k 
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Figures  4.11a  and  b  show  the  p  error  and  Q  estimates  for  = 

.  Here  the  range  rate  estimate  is  not  very  good.  A  large  bias 
appears  early,  although  this  decreases  as  larger  ^-estimates  are  obtained 
(Figure  4.11b).  Still,  only  30.08X  of  the  estimates  lie  under  the  la  curve. 

At  the  end  of  the  simulation  v57  =  ..033  x  u~?(m  <?.•«*> ’ . 

A  number  of  runs  were  made  to  investigate  the  effects  of  different 
a  puioKA  values  of  v.  The  results  indicated  that  the  <i  j value  has  little 
effect  upon  both  the  estimates  of  r  and  Using  the  algorithm  of  equation 

30 ,  the  estimated  values  of  r  decreased  monotonically  from  21.94  (m/sec)1' 
to  .2571  (m/sec)4.  Presumably  if  more  points  were  taken,  this  would  decrease 
even  further.  The  modification,  given  by  equation  (3.6.36)  of  using  a  limited 
batch  of  A-residuals  was  not  investigated  for  this  problem,  but  is  treated  in 
the  next  chapter. 


Estimation  of  Model  Errors.  Suppose  the  assumption  of  a  constant 
closure  rate  is  modified  so  as  to  be  a  linear  function  of  time.  We  thus  assume 
the  model  error  may  be  approximated  by  equation  (3.3.6).  If  we  further  assume 
oo  =  0  and  A  e  A,,  then  the  range  rate  model  becomes 


pk  *  pk-i  +  ck-i  At 


°k  =  °k-i  +  w  k-i 


■1.6.0 


where  a  is  unknown,  and  is  to  be  estimated  along  with  p  .  The  term  v'  is  a 

/C  A 

process  noise  term  added  to  a  and  represents  fundamental  uncertainties  in 
knowledge  of  a.  Its  statistics  are  E{w'}  =  0  and  E{w'2}  =  q.  There  are  now 
two  state  variables,  and  the  problem  may  be  cast  in  the  canonical  form  as 


pi  -  P  4tl  pi  *  p] 

k  L0  1 J  L°J 


I  Xk-1  +  'k-1 


(4.6.7} 
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In  this  recast  form,  the  approach  Is  to  use  the  standard  minimum  variance 
filter  with  the  corresponding  ^-matrix  being 

H  -  [1  0]  (4.6.8) 

In  light  of  this  extended  form.  It  Is  Important  to  raise  the  question  of 
observability.  Consider  the  information  matrix  given  by  equation  (i..-.ri) 
with  j  *  7. 


.  2  =  L 

i=l 


$.T.  h.t  h. 

i,k  i  i 


-1 


H, 


lk 


1 

t  • 


t*  .  *'  ' 


Substituting  for  4>,  //,  and  R, 

■  t  u  i]  [;] 


(IOC)  [1  0 ) 


=  look 


f  r  *  rW  1 

h  Lrvv  fw2J 


Forming  the  quadratic  form  using  an  arbitrary  2-vector,  s,  yields 


(4.6.10) 


/t 


3  -  MW!  £,  <*■ +  Vei  -  v>: 


(4.6.11) 


which  is  clearly  positive  for  all  non-zero  values  of  z  and  k  >  1.  Thus  our 
assumed  dynamics  represent  a  completely  observable  system.  Note  that  this 
would  not  be  true  were  we  Interested  only  in  o ,  since  the  observations  do 
not  relate  directly  to  o.  Rather,  information  about  a  is  obtained  only 
through  its  correlation  with  p.  Thus  in  estimating  a  it  is  the  cross- 
correlation,  a^c  (normalized  by  the  variances  o^2  and  r)  which  comprises  the 
gain,  K  ,  for  c, 

C 

K  =  a.  /(a.2  +  R)  (4.6. 1Z) 

c  pc  p 

We  note  in  passing  that  the  system  is  also  completely  controllable 


since 
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\,0  "  « 


wlrvtj  2  J 


which  follows  from  (1.3.25). 

In  carrying  out  the  simulations  for  this  formulation,  the  a  pxicii 
values  of  o  =  0  and  p{52}  =  l.Ofm/aec 2J2  are  used.  Initial  results  with 
Q  »  e{imt}  =  0  are  shown  in  Figure  4.12a.  The  performance  is  such  that  as 
the  error  covariance  decreases,  so  does  the  gain,  and  hence  the  constant  i? 
tends  to  a  particular  value.  However,  the  true  error  actually  changes  in  a 
non- linear  manner  so  the  bias  error  begins  to  dominate  after  about  800  seconds 
(Only  26.82%  of  the  384  points  are  within  ;/o  of  the  expected  value.)  This, 
of  course,  is  due  to  the  fact  that  Q  -  0.  Now  examining  the  /o- curve  suggests 
that  a  reasonable  in  steady-state  value  of  .02  m/sec  for  p  might  be  acceptable 
The  value  of  Q  which  yields  this  Jo-value  may  be  found  using  the  covariance 
prediction  and  update  equations.  For  the  prediction, 

P  =  *  P  4>r  +  Q 


r p i i  piai  r j  Afcirpn  p^itj 

lP21  P22  J  10  1  J  L  P  2  1  p?2JLAt  /  J  L  0 


Carrying  out  the  multiplication  yields 

Pu  =  Pll  +  3p  12  At  +  p 22  At2 


P 12  *  Pl2  +  p 2  2  At 


(1.6. 


( ■!.  6 .  {-Pi.! 


P 22  *  P 22  +  Q 


where  the  symmetry  of  P  is  recognized  as  pi2  *  P21. 


(-1.6.  JvV 


For  the  update  equation, 


( n/sec >  p  (n/sea) 
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P  -  (1  -  K  H)  P 


(p  pi  0  )  pu  P 1 2 *1 

IU  L&2  0  ml  _p  2 1  P22J 


or 


[P 11  p  1 2 *1  V(l-k\)  pu  (l-k\)  p  12  | 

P21  P22J  L"^2  Pu  *  P21  "^2  P12  +  P22J 


Pu  P12I  I  (l-kiJ  Pu  (l-k\) 

,P2i  P22J  L"^2  P11  *  P21  -k.2  p  12  *  P22J  (4.6.25) 

Employing  the  definition  of  K,  and  again  recognizing  the  symmetry  of  F, 


P 12 


■(i-=^y 

'  Pu  +  R ' 

\  p  11  +  R  / 

2 


A  Pi l 


Pll  -  — 


Pu  +  R 


R  p  12 


p  1 2  =  — 


P11  *  R 


Pu 


P22  *  -  ~ 


T  p22 


(4.6.16a) 


(4.6.16b) 


(4.6.16a) 


Pu  +  P 


(4.6.17) 


Solving  (4.6.16a)  for  pu  gives 
Pu  =  pu  -  p\\) 

For  steady  state  conditions,  the  pi.  in  (4.6.14)  equal  the  pi^  in  (4.6.16). 
Substituting  (4.6.14a)  Into  (4.6.16a)  and  solving  for  p\2  yields 


P12  =  dj(  pu  v-  R) 

Substituting  this  into  (4.6.16b)  gives  the  equation  forp12  as 

pi.'  R  ' 1 1  ^  R)  /(p\\  +  R) 


(4. 6. IS) 


1 4.6. 16) 


Next  equate  (1.0.  Mb)  and  (4.6.16).  and  substitute  ( 4.6.10 )  for  p.>  to  obtain 


P22  “  At  +  R)  (1'~ -  ; 


I  r  •  (1  »  •  * 


pu  +  R 


149 


Finally,  use  (4.6.17),  (4.6.19),  and  (4.6.20)  to  solve  (4.6.14a)  for  a.  The 
result  is 

Pi  i4 

q  « -  (4.6.21) 

u}(r  -  pu)(2R  -  pn;2 

Using  the  suggested  steady  state  value  of  a  *  .02  m/sec,  then  rn  -  a2  - 
4  >  10" ''(m/sec) 2  and,  approximately, 

q  .  /  *  10  '  (m/y<‘<'? ) 7 

The  values  of  A/  and  /.  are  taken  from  Table  4.1. 

Using  this  value  of  </  along  with  the  other  initial  and  a  /."» <>•(<  values 
( a  =  0  and  i:{,-?7)  =  nr /see1*)  produces  the  results  shown  in  Figure  4.12b. 
Here  definite  improvement  is  achieved  as  expected.  74.74%  of  the  384  points 
are  within  the  steady  state  la  value  of  .0209  m/sec  as  opposed  to  the  26.82’.' 
associated  with  Figure  4.12a.  Notable  characteristics  are  the  apparent 
oscillations  of  the  error,  and  the  fact  that  the  majority  of  points  lie  above 
the  time  axis.  Oscillatory  error  behavior  is  known  to  be  characteristic  of 
optimal  approximations,  e.g.,  (48,  75),  and  here  optimality  is  clearly  an 
objective.  The  latter  characteristic  implies,  as  in  earlier  runs,  that  the 
bias  error  is  becoming  apparent.  An  obvious  suggestion  is  to  increase  q. 

The  results  for  <y  =  .  1  x  io~6(m/sec2)2  are  shown  in  Figure  4.12c.  This 
arbitrary  increase  in  q  produces  a  steady  state  la  value  of  .053  m/sec,  and 
bounds  83.33%  of  the  error  points.  In  light  of  these  results,  it  appears  that 
the  optimum  q  is  probably  between  .1  *  10~9 (n/sec2)2  and  .1  *  ::~6(r  era2':. 

To  pursue  the  question  of  optimum  <3,  and  also  to  Investigate  the 
sequential  Q  estimation  algorithm  further,  the  idea  of  estimating  arises. 
However,  in  applying  the  sequential  algorithm  strictly  to  the  formulation 
given  by  (4.6.7)  the  observability  problem  is  brought  out  again,  this  time 
with  regard  to  estimating  Q.  In  writing  the  information  matrix,  r^, 
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corresponding  to  the  estimation  of  Q  one  finds  that  7°  «  o.  Because  of  the 
way  the  problem  is  formulated,  the  equation  for  r2  does  not  contain  .  expli¬ 
citly.  Thus  in  forming  7°,  the  measurement  mapping  term  is  zero  for  all  time, 
and  hence  so  Is  T°.  In  other  words,  the  term  corresponding  to  '  in  equation 
(1.3. PI)  is  zero.  Hence,  the  Q  corresponding  to  is  not  observable. 

Consider  the  formulation  of  the  original  problem  as  a  differential 
equation.  In  vector  form 
x  *  A  x  +  li  u 


or 


cm: 


(■Le.c'y 


where  the  identifications  are  obvious.  The  term,  u,  is  a  process  noise  term 
with  the  statistics  f{u}  =  o  and  E{u2}  =  q  &(t-t).  Now  the  solution  to  this 
differential  form  is  clearly  not  (4.6.7),  but  rather 

t. 


k  L 


i 


k"J  Jtk-i  L 


(t.-x) 

k 

7 


U  d  T 


(4. 8.23) 


Taking  the  approach  of  equation  (J.P..30),  where  n  is  assumed  constant  over 
[tk*  lk-i]'  yields 


X.  -  <t>.  .  ,  X.  + 

k  k,k-l  k 


k,k-l  k-1 


*  .0 . 4.  V 


J  ) 


where 


At 

At2' 

p 

’  0 

Vk,k-1 

.  0 

a 

At  . 

•  * 

U.  7 

L  k-1  J 

It  is  Important  to  note  that  one  form,  (4.6.7)  or  (4. P.24)  ,  is  not  necessarily 
more  correct  than  the  other.  In  both  cases  uncertainty  has  been  introduced 
(Uibitkahi (tj\  hence  the  measure  of  correctness  rests  in  the  c/iruv  of  the 
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model,  and  ultimately  In  the  performance  of  the  estimation  process.  For  our 
purposes  here  (4.6.24)  is  indeed  necessary  since  this  appears  to  be  the  only 
way  in  which  we  can  "get  a  handle"  on  Q. 

To  bring  this  approach  to  fruition,  use  (4.6.24)  to  form  the  error 
covariance  prediction  equation  as 


7  =  <f  p  4>t  +  r  q  v 

where 
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Carrying  out  the  first  Integration, 


The  difference  between  (4.6.29)  and  (4.6.26)  is  approximately  the  factor,  At. 
Forming  the  measurement  equation  we  have 


~  <J k  +  Ok 


and  similarly  the  information 


k.l 


k  T  At" 

p 


>  0 


(;.p.,\'' 


(4.6.f!> 


insures  complete  observability.  Owing  simply  to  the  more  rigorous  approach. 
(4. 6.  ?.o)  will  be  used  for  the  estimation  of 

Using  this  alternate  formulation,  Figures  4.13  through  4.15  show  the 
estimation  performance  for  the  three  a  pxioni  values,  -  t0~*>  and 

10~ 1 2 ( rn/ccc2 ) 4 ,  respectively.  The  results  indicate  increasing  estimation 
accuracy  of  the  range  rate  with  decreasing  a  piioii  values  of  In  particu¬ 
lar,  for  r;#  -  io~ B(m/r,ec2 )" ,  74.22%  of  the  range  rete  error  values  are  less 
than  the  lo  curve  (F1gures4.13a) .  From  Figure  4.13b  the  final  value  of  q  is 
approximately  4.3  x  io~t(m/eeo2)2 .  In  Figure  4.14a,  s0  =  :c~ 1 0 1 r  tv.*2 ■' 4 , 
78.78%  of  the  range  rate  error  points  are  less  than  Jo,  and  the  maximum  value 
of  q  (Figure  4.14b)  is  approximately  2.  x  lo'*(m/eea2)2 .  For  t-'0  =  ic~ 1:, 

80.1%  of  the  range  rate  errors  a’-*  within  la  of  the  expected  zero  mean  .(Figure 
4.15a).  The  maximum  cy-value  is  approximately  1.7  x  io~10 (r/eec2)2  (Figure 
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Figure  4.13  Range  Rate  Error,  p,  and  State  Noise  Standard 
Deviation,  <,  for  Linear  Error  Approximation 
With  5n  =  10  8  (m/sec3  )** 
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4.15a).  While  the  actual  magnitudes  of  the  ^-estimates  become  smaller  as 
the  a  pxioni  s  is  decreased,  within  each  case  the  sequence  of  Q  values 
increases.  This,  of  course,  results  in  increasing  the  error  variance  of  p. 

As  before,  this  may  be  attributed  to  the  error  growth,  for  while  the  error 
has  been  reduced,  it  has  not  been  eliminated. 

Note  that  for  sc  -  io~B (m/aec2 )u ,  comparatively  large  values  of  i,' 
are  obtained.  This  is  due  to  the  fact  that  initially  larqe  values  of  the 
gain  are  computed  thus  tending  to  make  the  values  of  ^  "follow"  the  observa¬ 
tions.  The  result  is  that  the  erratic  behavi  *r  of  e  from  one  observation  to 
the  next  appears  to  be  quite  random.  The  sequential  ^-estimator  sees  this 
and  produces  correspondingly  large  values  of  Q.  As  smaller  values  of  .'0 
are  employed,  this  effect  becomes  less,  thus  allowing  the  values  of  .*  to 
become  more  stable.  Figures  4.14a  and  4.15a  show  this  to  some  extent.  The 
estimates  of  Q  do  not  increase  quite  as  rapidly;  their  magnitudes  are  less, 
and  the  values  tend  to  reach  their  maximums  sooner. 

From  these  results  it  is  apparent  that  various  ft  priori  values  of  ^ 
have  definite  effects  upon  the  values  of  the  ^-estimates.  However,  through 
reasonably  careful  selection  of  SQt  practical  filter  operation  can  be  realized. 

4.7  Summary  and  Conclusions 

In  this  chapter  the  salient  aspects  of  the  estimation  techniques 
found  in  Chapters  2  and  3  have  been  investigated,  and  the  algorithms  applied 
to  the  rendezvous  problem  formulated  in  Chapter  1.  Explicit  closed  form 
expressions  have  been  obtained,  in  whole  or  in  part,  for  the  age-weighting, 
ordinary  and  modified  gain  scaling,  additive  gain  term,  and  limited  memory 
filter  algorithms.  For  these  as  well  as  all  the  other  algorithms,  numerical 
results  have  been  obtained. 
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Certain  equivalences  were  shown  to  exist  in  both  the  adaptive  and 
non-adaptive  forms  of  the  age-weighting,  modified  gain  scaling,  additive  gain 
term,  and  state  noise  covariance  algorithms.  While  these,  as  well  as  the 
limited  memory  algorithm,  were  found  to  be  acceptable  estimators,  the  ordinary 
gain  scaling  technique  ( b  =  constant)  was  shown  to  be  unacceptable,  in  both 
its  adaptive  and  non-adaptive  form. 

The  adaptive  techniques,  based  on  Jazwinski's  maximum  likelihood  method 
as  well  as  the  sequential  (^-estimator,  are  seen  to  be  a  step  closer  to  accurate 
estimation.  The  techniques  based  on  Jazwinski's  approach  have  the  particular 
advantages  of  being  both  simple  and  completely  adaptive.  The  primary  dis 
advantage  is  the  fact  that  usually  smoothing  must  be  introduced  in  the  form 
of  a  residual  sample,  thus  destroying  the  optimality  of  the  method. 

The  Kalman  filter  for  sequentially  estimating  the  state  noise  co- 
variance  appears  to  give  slightly  better  performance  than  Jazwinski's  adaptive 
estimator.  Within  the  stated  assumptions,  the  method  is  a  constrained  minimum 
variance  estimator.  However,  the  technique  is  more  complex  to  implement. 
Further,  the  assumption  of  b'{qQ]  =  0  destroys  the  optimality.  If,  in  fact, 

Q  does  not  represent  a  stationary  process,  then  the  assumption  of  constant  H' 
introduces  modeling  error,  thus  requiring  more  accurate  modeling  or  some 
technique  to  prevent  saturation  of  the  ^’-filter. 

The  estimation  of  modeling  errors  has  been  shown  to  be  particularly 
useful  in  that  this  approach  attempts,  in  some  way,  to  improve  the  dynamic 
model.  Invariably  the  chosen  structure  is  not  sufficiently  exact  to  allow 
the  filter  to  operate  to  saturation.  Thus,  some  model  error  compensation 
technique  must  still  be  employed,  e.g.,  state  noise  covariance.  In  this 
regard,  the  sequential  state  noise  covariance  estimator  has  been  shown  to  be 
effective. 
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The  rendezvous  problem  investigated  here  involves  a  single  state 
variable,  and  as  such,  the  information  and  comparisons  presented  must  be 
accepted  in  this  light.  When  the  algorithms  are  extended  to  state  vectors 
of  more  than  one  variable,  some  of  the  algorithms  lose  their  similarities. 

In  particular,  the  modified  gain  scaling,  additive  gain  term,  and  state  noise 
covariance  algorithms  in  both  their  adaptive  and  non-adaptive  forms  produce 
results  which  are  generally  different  from  one  another,  further,  da  winski's 
adaptive  c, .’-estimator  requires  a  pseudo- inverse  in  computing  a  -matrix,  this 
disadvantage  is  not  suffered  by  the  sequential  i.’-es  lima  tor.  1  he  limited  memory 
filter  is,  in  an  a  po&ti'Hiofii  sense,  an  optimal  estimator.  However,  applied 
to  the  vector  case,  it  requires  considerable  numerical  computation,  lo  in¬ 
vestigate  these  algorithms  as  applied  to  the  vector  case  of  more  than  one 
variable  is  an  undertaking  of  considerably  greater  scope.  Further,  the 
insight  provided  by  the  closed-form  expressions  for  many  of  the  algorithms 
would  be  difficult,  if  not  impractical,  to  obtain  because  of  the  greater 
problem  complexity. 

Based  in  general  upon  the  results  obtained  in  this  chapter,  the  model 
error  estimation  technique  and  the  sequential  state  noise  covariance  estimator 
have  been  selected  for  application  to  a  multi-element  state  vector  problem. 

The  choice  of  this  approach  is  due  to  the  inherent  potential  for  effective 
application  to  more  complex  problems.  As  indicated  in  Chapter  1,  the  problem 
is  the  orbit  determination  of  a  low  thrust  space  vehicle  which  is  subject  to 
thrusting  errors.  This  investigation  is  carried  out  in  the  next  chapter. 


Chapter  5 


APPLICATIONS:  LOW  THRUST  VEHICLE 
ORBIT  DETERMINATION 


5.1  Introduction 

This  chapter  is  concerned  with  the  problem  of  estimating  the  state 
of  a  continuously  thrusting,  solar  electric  propulsion  (SEP)  space  vehicle. 

The  problem  is  compounded  by  the  fact  that  the  propulsion  system  is  subject 
to  certain  mechanization  and  control  errors.  While  errors  arise  from  various 
other  sources  (49),  for  the  SEP  vehicle,  the  primary  errors  are  due  to 
anomalies  in  the  propulsion  system.  Our  aim  here  is  to  investigate  the 
practicality  of  estimating  not  only  the  vehicle  state,  but  the  corresponding 
thrust  acceleration  errors  as  well. 

A  number  of  investigators  have  considered  low  thrust  SEP  missions. 

In  (50),  Rourke  and  Jordan  investigated  guidance  and  navigation  approaches 
for  two  SEP  interplanetary  missions,  although  model  errors  were  not  estimated. 
Russell  and  Curkendall  (36)  obtained  effective  results  by  using  piecewise 
constant  functions  to  model  acceleration  errors.  Tapley  and  Hagar  investi¬ 
gated  the  estimation  of  acceleration  errors  (34)  as  well  as  inertial  measuring 
unit  errors  (35)  for  an  SEP  vehicle.  These  approaches  employed  Kalman  filter¬ 
ing  utilizing  the  differential  equation  for  /  .  Errors  were  successfully 
modeled  as  first  and  second  order  Gauss-Markov  processes,  although  Earth 
rotational  dynamics  were  not  considered.  Carpenter  and  Pitkin  (37)  investi¬ 
gated  orbit  determination  for  an  SEP  vehicle.  Here  the  total  thrust 
acceleration  was  assumed  to  be  unknown,  but  approximated  as  the  solution 
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to  a  set  of  uncoupled  linear  differential  equations.  However,  the  approach 

followed  a  least  squares  linearized  simultaneous  solution  with  no  statistical 
measures  being  employed. 

As  indicated  in  the  previous  chapter,  our  approach  here  is  to  assume 
a  structure  for  the  acceleration  errors.  Then,  employing  the  selected  filter¬ 
ing  equations,  the  parameters  associated  with  the  assumed  model  error  structure, 
as  well  as  the  position  and  velocity  of  the  vehicle  are  to  be  estimated. 

In  the  following  sections,  the  specific  dynamics  and  estimation 
equations  are  developed.  Five  different  model  structures  are  investigated 
as  to  their  ability  to  represent  the  acceleration  errors.  Each  is  developed 
separately,  and  numerical  simulations  are  carried  out  to  obtain  comparative 
estimation  performance.  Corresponding  to  each  of  these  structures,  purely 
random  errors  are  also  assumed  to  be  present.  Associated  covariances  are 
either  input,  or  estimated  using  the  sequential  (.'-estimator. 

5.2  Problem  Description 

In  the  problem  considered,  the  motion  of  the  solar  electric  spacecraft 
is  assumed  to  be  influenced  by  random  errors  in  the  thrust  acceleration  vector. 
The  nominal  SEP  mission  simulation  is  initiated  at  escape  from  the  Earth's 
sphere  of  influence  and  terminates  with  a  flyby  of  the  asteroid  Eros.  En¬ 
counter  with  Eros  occurs  at  a  distance  of  1.45  astronomical  units  (a.u.), 

152  days  after  heliocentric  injection. 

If  the  only  central  force  attraction  considered  is  the  sun,  the 
equations  of  motion  for  the  SEP  spacecraft  are 
p  =  o 

v  -  ■  i*  +  t  i . .  :  •’ 

U.I  3 
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where,  as  shown  In  Figure  5.1,  r  is  a  3- vector  of  heliocentric  position  com¬ 
ponents,  Xt  y,  z;  y  is  a  3-vector  of  heliocentric  velocity  component'  y, 

Z;  |r|  is  the  magnitude  of  r;  and  y  is  the  gravitational  parameter  of  the  sun. 
T  is  the  heliocentric  thrust  acceleration  vector  composed  of  the  design  thrust 
acceleration,  r*,  as  well  as  thrust  acceleration  errors,  r;( r ' ,  from  a  number 
of  sources  (beam  voltage  and  current,  grid  warpage,  deadband  control  errors, 
etc.).  The  heliocentric  components  of  T,  [T  T  T  ]\  may  be  expressed  in 

X  X  Z 

a  vehicle  centered,  orbit  frame  as  [ T  T  T  ]T,  where  the  two  vectors  are 

X  Y  Z 

related  by 
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where  ip  is  the  heliocentric  orientation  angle  (see  Figure  5.1).  The  two 
reference  frames  are  oriented  such  that  the  z  and  a  axes  are  parallel;  the 
X  and  x  axes  form  the  angle,  ip,  as  do  the  Y  and  ?/  axes,  with 

cos  ip  =  X/(X2  +  Y2)'*  ,  sin  \p  =  Y/(XZ  +  Y2)'3 

5.3  Acceleration  Error  Simulation 

The  SEP  spacecraft  is  driven  by  an  electric  engine  which  in  turn 
obtains  its  power  from  solar  energy  conversion  devices,  i.e.,  solar  cells. 

While  the  actual  solar  flux  density  follows  the  inverse-square  law,  for 
outbound  missions  the  actual  available  thruster  power  varies  as  j*1,7.  This 
is  due  to  improved  efficiency  of  the  solar  cells  at  lower  temperatures  (50,  2). 
In  addition  to  the  thruster  power  variations,  the  propellant  mass  flow  rate, 

/*,  affects  the  magnitude  of  the  thrust  acceleration.  In  particular  the 
magnitude,  a,  of  the  thrust  acceleration,  is  given  (50,  3)  as 


Figure  5.1  Reference  Frarres 
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where  I  Is  the  engine  specific  impulse,  Is  sea  level  Earth  qravitational 
sp 

acceleration,  and  M  is  the  instantaneous  spacecraft  mass.  These  quantities  in 
turn  are  functions  of  numerous  propulsion  system  parameters,  all  with  various 
uncertainties.  Some  of  these  have  been  mentioned  --  beam  voltage  and  current, 
grid  warpage,  and  deadband  control  errors. 

Regardless  of  the  sources  of  error,  their  effect  is  to  produce  un¬ 
certainties  in  the  thrust  acceleration  program.  For  the  orbit  determination 
function,  we  are  interested  in  the  errors  at  this  total  level,  and  it  is  at 
this  level  we  propose  to  account  for  them.  Now  since  we  are  not  concerned 
here  with  the  guidance  problem,  the  design  thrust  proqram  is  arbitrary.  Hence, 
for  simulation  purposes,  a  constant  design  thrust  acceleration  magnitude  is 
selected.  Further,  the  proqram  is  such  that  the  nominal  thrust  acceleration 
vector,  t* ,  is  oriented  colinear  with  the  v-axis  of  the  orbital  frame.  The 
true  thrust  acceleration  vector,  ",  can  be  resolved  into  its  orbit  frame 
components,  'l'x ,  V’  ,  and  V'z,  in  terms  of  its  magnitude,  and  the  clock  and 
cone  angles,  0  and  y,  respectively  (see  Fiqure  5.1).  Thus, 


~T 

sin  Y 

cos  G 

!  x 
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=  a 

COS  Y 
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sin  y 
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_  m 

and 

a  =  a*  +  5a  if. f. f 1 

where  a*  Is  the  nominal  constant  thrust  acceleration  magnitude  and  is 
associated  error.  With  zero  error,  the  thrust  acceleration  is  nominal, 
this  case, 
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In  the  presence  of  thrusting  errors,  y,  0,  and  6a  are  non-zero  thus  producing 

an  off-nominal  thrust  acceleration  vector. 

The  acceleration  error  magnitude  is  simulated  usinq 

6a  -  6a.  sin  ut  +  u  (6.3.6) 

o  a 


where  <5a„  and  w  are  constants  and  where  u  Is  a  random  variable  with  the 
0  a 

statistics 

E{u  }  =  0  ,  E{u  2 }  =  o  2  (6.3.0 

<3  3  <3 

In  the  error  simulation  the  instantaneous  values  of  the  pointing  angles,  ^ 
and  0,  are  assumed  to  be  related  as  shown  in  Figure  5.2.  This  figure  shows 
the  x-z  plane  of  the  orbital  frame.  Assuming  the  cone  angle,  >,  is  small, 
the  radius  of  the  circle  is  the  maximum  deviation,  sin  ”  =■  y ,  of  the  normal¬ 
ized  thrust  vector*  from  its  nominal  position  co-aligned  with  the  y-axis. 
Next,  the  quantity 

d  =  s( t  -  t  )  (£.3.?) 

D 

Is  the  distance  that  the  tip  of  the  normalized  thrust  vector  has  moved  since 
it  last  touched  the  boundary  given  by  the  circle  of  radius  y.  The  rate, 
is  simulated  as  a  constant  plus  an  additive  noise  component  obtained  from  a 
normal  distribution,  u(0.  o  2) ;  t  is  the  current  mission  time,  and  r.  is  the 

S  V 

time  the  boundary  circle  was  last  touched  by  the  normalized  thrust  vector. 

The  angle,  <f>,  is  obtained  by  sampling  from  a  uniform  distribution,  ."i k :  1 . 

V 

With  this  information  the  instantaneous  values  of  y  and  0  can  be  obtained  at 
each  point  in  time.  To  clarify  the  process,  the  simulation  looic  is  given 


*  That  is,  a  unit  vector  in  the  direction  of  the  thrust  vector. 
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In  Figure  5.3.  Assume  the  initial  values  of  >,  .|»,  »?,  r  ,  0,  and  o  r  and  o.* 

0  S  if' 

are  given.  To  determine  the  values  of  y  and  o  at  /  .  the  distance.  .  is 

A  \ 

first  computed.  Using  the  law  of  cosines,  the  angles  >A  and  ;  are  deter¬ 
mined.  Then  6  is  computed  by  adding  or  subtracting  z  according  to  the  sign 

A  A 

of  <J>.  Next  a  test  is  made  to  determine  if  the  computed  >  lies  within  the 

/V 

boundary  circle.  If  it  does,  then  sample  values  from  the  normal  distributions, 
N(Ot  oy2;  and  N( ot  oQ2)  are  added  to  ya  and  8^  respectively.  If  the  test  is 
not  passed,  then  new  values  of  6,  4>,  s,  and  t  are  found.  Normally  distributed 
noise  is  then  added  to  y  and  0A  as  before.  For  each  point  in  mission  simu¬ 
lation  time,  the  sequence  is  repeated  to  obtain  appropriate  simulation  values 
for  the  pointing  anqles. 

5.4  Observation  Geometry  and  Equations 

Two  observation  types  are  employed  for  estimating  the  state  vector. 
These  are  the  radar-measured  range  rate,  p,  of  the  vehicle,  and  the  angle, 

C,  fonned  by  the  lines-of-sight  to  the  Earth  and  to  a  specified  navigation 
star  (see  Figure  5.4).  In  computing  range-rate,  the  motion  of  the  tracking 
station,  due  to  Earth  rotation*  as  well  as  orbital  revolution,  is  taken  into 
account. 

Consider  Figure  5.4.  The  range  vector  can  be  expressed  as 

p  =■  r  -  r  -  v  -  R  -  h  if.-:'.  " 

s  s 

where  r*  is  the  heliocentric  position  vector  of  the  vehicle;  is  the  helio¬ 
centric  position  vector  of  the  tracking  station;  fi  is  the  geocentric  position 
vector  of  the  station;  and  r  is  the  heliocentric  position  vector  of  the 

C 

Earth.  In  the  figure,  the  reference  frame  X'Y'Z'  is  a  geocentric  frame 

*  The  diurnal  effects  of  station  motion  have  been  shown  to  provide  a 
significant  contribution  in  obtaining  accurate  orbit  determination  (51,  34). 
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aligned  with  xtz.  The  X  and  A'  axes  are  assumed  to  be  aligned,  pointing  to 
the  First  Point  of  Aires,  T.  x"Y"Z"  Is  a  geocentric  frame  whose  x"-Y"  plane 
contains  the  equator.  Thus,  x"Y"Z"  is  rotated  about  a'  through  the  angle, 
e,  the  obliquity  of  the  ecliptic  (c  =  23h°).  The  heliocentric  components  of 


R  are  simply 


QOS  s 


ffe  -  y«  "  I*.  I  sin  « 


(  «'  •  *:  •  .  • 


where  \u  |  is  the  magnitude  of  /?  ,  and  5  is  the  Earth's  heliocentric  orienta¬ 
tion  anqle.  The  heliocentric  components  of  R  are  A  ,  \  ,  and  ;  thev  are 
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The  anqle,  <J>  ,  is  the  qeographic  latitude  of  the  tracking  station,  and  '  is 
the  right  ascension  of  the  station. 

Now  taking  the  derivative  of  (5.4.1)  with  respect  to  time  gives 

"  •  •  •  *  * 
p  =  r  -  r  =  r  -  R  -  R 
s  s  e 


=  r  -  R  -  x  R  -  R  -  w  x  R 
s  s  s  e  e  e 
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where  (  )  is  the  relative  derivative,  ftg  is  the  angular  velocity  of  the 
station,  and  is  the  orbital  velocity  of  the  Earth.  To  simplify  the  com¬ 
putational  process,  the  assumption  is  made  that  the  Earth's  orbit  is  circular 
and  hence  we  and  Rc  are  constant.  The  angular  velocity,  ft  ,  is  composed  of 

the  sidereal  rotation  rate,  w  ,  and  w  : 

s  e 

ft  =  w  +  lx) 
s  s  e 

*  E  0)  "  +  w  (5.4.  ?> 

s  o 


a)  "  is  aligned  with  z"  and  w  is  aligned  with  z.  Thus 
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with  -  |a,  |  t  and  i,  =  IcoJ  t.  Thus  the  desired  range  rate  may  be  found 
as  the  magnitude,  p,  of  (5.4.1). 

The  star-vehicle-Earth  angle  is  measured  on  board  the  spacecraft.  It 
is  found  simply  as 


For  each  simulated  observation,  random  noise,  ;<  ,  is  added  to  the 
deterministic  value  by  sainplinq  from  a  normal  distribution.  If  the  discrete 
observations  at  time  t.  are  specified  qenerically  as  the  associated  statis¬ 
tics  for  .  are 


"''W  ’  0 


o.,.}  =  h‘.  iS .  . 
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5.5  Error  Compensation  Models 

As  indicated  in  section  5.2,  the  simulated  thrust  acceleration  is 
composed  of  a  nominal  or  pronramined  thrust  acceleration,  plus  an  error 
component,  m(t).  The  nominal  dynamic  model  assumed  for  the  estimation  process 
is  essentially  the  same  as  equation  ((>.:'.!).  Of  course,  the  accelerating 
error  vector,  w(t)%  is  unknown.  We  assume  an  approximation,  n'M,  to  *m  :  > , 
where  eft)  is  modeled  as  a  stochastic  process  with  Wr’l  satisfying  one  of 
several  possible  first  or  second  order  differential  equations.  The  elements 
of  e(t)  are  the  three  orthogonal  components  of  the  acceleration  error,  and 
are  expressed  in  the  orbital  frame. 

While  the  general  form  given  by  equation  (?.-}.v)  might  be  employed  to 
obtain  a  single  representation  for  the  error,  the  selection  and  use  of 
specific  models  offers  the  opportunity  to  investigate  the  effects  of  different 
structures,  i.e.,  itAuctunat  iemitivittj.  For  this  reason  five  different 
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models  are  employed:  three  basic  forms,  two  of  which  have  two  separate 
configurations.  These  are  described  in  the  following  paragraphs. 

Model  0.  In  this  model  the  acceleration  error  components  are  assumed 
to  be  represented  as  purely  random  process  noise.  Thus  only  a  selected  state 
noise  covariance  matrix  is  employed  to  maintain  a  positive  definite  error 
covariance,  and  hence  filter  operation,  and  to  prevent  divergence.  In  this 
case  we  define  the  estimated  state  vector  to  be  \  =  [rj  <»r],  and  the 
corresponding  differential  eguations  are 
r  =  v 


V  - - —  v  +  T*  +  M  u  f  ■  .f.  /" 

M3 

M  is  given  by  (!,. and  u  is  random  state  noise  representing  the  accelera¬ 
tion  errors.  The  a  ptivii  statistics  for  u  are 

/•/{«}  -  o  ,  h:\iijU)  u;(i))  -  ,.(!)  {>(!-[' 
where  /  i  (.r,  y, 

Model  1.  The  thrust  acceleration  error  component  along  the  orbit 
frame  y-axis  (in  the  direction  of  the  nominal  thrust)  is  approximated  by  a 
first,  order  Markov  process;  the  .<•-  and  components,  in  the  orbit  frame,  are 
assumed  to  be  purely  random  processes.  Here  the  nominal  differential  eguations 
are 
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with  the  addition  of  one  of  the  following  two  configurations: 
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The  random  variables  u  ,  u  ,  u  ,  and  u  have  the  statistics 

X  Z  i?  u 

E{u^)  =  0  ,  F,{u^(i)  u^(x)}  -  q,(t)  S(l-x) 

where  l  c{x,  z3  e,  a).  For  each  of  the  configurations  a)  or  b)  above,  the 
corresponding  state  vectors  to  be  estimated  are 
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respectively. 


Model  2.  The  thrust  acceleration  error  component  along  the  orbit 
frame  y-axis  is  approximated  by  a  second  order  Markov  process.  The  and  - 
components  are  again  assumed  to  be  purely  random  elements.  The  corresponding 
nominal  differential  equations  are  then 
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olus  one  of  the  following  configurations: 

d )  CJ  —  ti 

y  g 

.  • 

b)  q  -  -Be  +  u 
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8  '  "6 

The  random  variables  u  ,  u  ,  u  ,  and  n„  have  the  a  ;.’Uoy<  statistics 

x  z  g  6 

E{Uj)  =  0  ,  E{uj(t)  u^(j)}  =  q-j(t)  6(t- t) 

where  l  s,  a,  flh  For  each  of  these  above  configurations  the  correspond¬ 
ing  state  vectors  to  be  estimated  are 
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respectively. 

In  the  ensuinq  discussions  the  particular  models  and  Mato  vectors 
will  be  referred  to  often  by  their  model  number  and  letti?r  cnnfi miration. 

Thus  model  2b  is  that  described  by  model  2,  equation  and  con f inurn 

tion  b),  equation  i5.:>.io). 

Note  that  each  of  the  above  model  forms  seeks  to  approximate  i  ’  in 
its  own  way.  Model  0,  of  course,  has  a  null  structure  as  defined  in  section 
3.3.  Model  la  approximates  m(t)  by  a  sequence  of  constants  (Finure  5.5a); 
model  lb  uses  a  sequence  of  exponentials  (Figure  5.5b).  Model  2a  employs 
straight  line  seqments  (Figure  5.5c),  and  2b  approximates  >•:(?)  by  a  sequence 
of  arcs  corresponding  to  the  output  of  the  simple  harmonic  oscillator. 

For  any  of  the  assumed  models,  the  state  vector  differential  equation 
can  be  written  in  the  qeneral  form 

yj t)  =  F( x,  t)  i'i'.f.  : ' ' 

In  view  of  (5.2.1),  F(y,  t)  is  a  nonlinear  vector  function;  hence  a  suitable 
estimation  procedure  is  extended  form  of  Algorithm  I.  This  is  basically  the 
same  as  Algorithm  II,  except  that  here  the  integral  form  for  r  is  replaced 
with  the  discrete  form  given  by  (1.2.17). 

In  forming  p,  given  by  (1.2.17),  the  term  rcrT  is  required.  Two 
methods  for  determining  this  are  given  by  equations  (1.2. SO,  (1.2.23a),  and 
(1.2.55a).  The  latter  form. 
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Figure  5.5  Approximation  >-ls 
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is  the  more  correct  form,  and  is  used  exclusively  when  deal i mi  with  constant 
input  values  for  u  &(t- r)  -  E{u(t)  u(x)}.  In  this  case  the  elements  of  are 

the  terms  where  l  is  taken  from  one  of  the  index  sets  associated  with 

Model  0,  1,  or  2.  Numerical  computation  of  ivrr  is  performed  as  a  simple 
quadrature  based  on  the  mean  value  theorem  of  integral  calculus: 

T 


r<?rT  =  (B  U  B  T  +  $  B  U  B  r  T)  — 

*  k  k  k,k-l  k-1  k-1  k,k-l 


.'...5.  l: 


For  the  cases  where  the  sequential  1  estimation  algorithm  is  employed, 
equation  ( l.'.‘...Y6a)  is  used  with  r  given  by  (i.r.  : 


/* 

J  l 

k-1 


n  ,h 


1 1 


This  fonn  is  required  because  the  r  matrix  itself  is  used.  In  tliis  case  wo 
let  U  =  Qt  and  the  elements  are  again  the  terms  with  /  taken  from  the 

*  i 

appropriate  index  set.  The  numerical  computation  of  I  is  carried  in  a  manner 
similar  to  the  above,  again  using  the  mean  value  theorem: 

r  =  (b.+  .  ,5.  j 

/C  A  J  A"i  Cj 

The  elements  of  B  for  all  models  are  given  in  Appendix  B. 
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5.6  Simulation  Process 

Figure  5.6  is  a  functional  flow  diagram  of  the  simulation  process. 

It  represents  the  logic  followed  by  the  simulation  computer  program.  Input 
data  is  read  from  an  appropriate  input  file,  and  various  problem  parameters 
and  logic  switches  are  initialized.  Both  the  simulated  and  nominal  trajec¬ 
tories  are  numerically  integrated  simultaneously  through  one  observation 
interval.  The  simulated  thrust  acceleration  is  computed  according  to  Figure 
5.3,  and  the  nominal  thrust  acceleration  computed  from  one  of  the  appropriate 
error  models  of  equations  (5.5.1)  through  (5.5.10).  In  addition,  the 
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differential  equation  for  the  state  transition  matrix, 

*(t,  tj)  *  Aft)  9(t,  tj)  (1.2.29) 

is  simultaneously  Integrated.  As  discussed  in  section  1.2,  the  matrix,  A  ,  is 
the  partial  derivative  of  F  with  respect  to  the  state,  x-  The  elements  of  « 
for  each  model  are  given  in  Appendix  B. 

Next  the  first  simulated  observation  is  computed  with  a  random  noise 
component  being  added.  (Observations  are  processed  one  at  a  time;  hence,  we 
have  the  computational  advantage  of  scalar  observation.)  The  corresponding 
nominal  observation  value  is  determined,  and  the  observation  mapping  matrix, 

//,  evaluated.  The  elements  of  H  for  each  observation  are  given  in  Appendix  C. 

The  estimation  equations  are  employed  to  obtain  the  state  estimate,  \. 
If  any  further  observations  are  to  be  processed  at  this  time  point,  the  appro¬ 
priate  logic  is  repeated  as  shown  in  the  figure. 

A  test  is  made  to  determine  if  print  output  is  required,  and  if  so, 
the  appropriate  information  is  written.  A  test  is  also  made  to  determine  if 
the  final  simulation  time  has  been  reached.  If  it  has  not,  the  process  of 
integration,  observation,  and  estimation  is  repeated  for  the  next  and  succeed¬ 
ing  observation  intervals  until  the  final  time  is  reached.  Finally,  data 
plotting  is  accomplished  as  determined  from  input  data  and  the  simulation  run 
terminates. 

5.7  Numerical  Results 

In  performing  the  various  numerical  simulations,  a  common  set  of  basic 
problem  data  is  consistently  used.  This  approach  provides  a  common  basis  for 
evaluating  the  estimation  performance  of  each  of  the  approximating  models. 

This  set  of  data  is  given  in  Table  5.1.  The  initial  conditions,  position  and 
velocity,  are  the  same  for  both  the  simulated  and  nominal  trajectories.  The 


Initial  Conditions  Initial  Uncertainties  Thrust  Acceleration  Error  Simulation 
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observation  interval  is  a  constant  50  minutes  for  each  observation  type; 
hence  the  corresponding  variances  are  scaled  to  agree  with  this  sampling 
rate.  The  initial  geometry  is  such  that  the  Greenwich  Meridian  intersects 
the  X-axis  at  midnight,  GMT*,  forming  the  spacecraft  initial  subpoint 
location. 

In  order  to  gain  an  idea  of  the  acceleration  error  components,  the 
plots  shown  in  Fiqures  5.7  and  5.8  were  obtained.  Figure  5.7  shows  the  error 
components  alonq  each  of  the  orbital  frame  axes  as  functions  of  time.  Note 
the  periodicity  of  the  y-component  and  the  irregularity  of  the  x-  and  z- 
components.  Further,  the  y-component  is  approximately  three  times  as  great 
as  the  others.  Fiqure  5.8  shows  the  trace  of  the  thrust  acceleration  vector 
projected  on  the  orbital  frame  x-z  plane.  Note  the  approximately  circular 
bound  of  radius  =  .onzs  mn/scv2 .  The  errors  are  also  seen  to  be  somewhat 
concentrated  in  the  first  quadrant,  and  rather  less  dense  in  the  fourth 
quadrant.  As  the  simulation  time  continues  beyond  the  35  days  shown  here, 
one  can  expect  that  these  errors  would  be  more  uniformly  distributed  within 
the  full  region. 

In  the  following  paragraphs  numerical  results  are  presented  showing 
estimation  performance  for  the  various  models  and  approaches.  In  presenting 
the  data,  two  quantities  are  plotted  which  represent  a  fiqure  of  merit  of 
performance.  These  are  the  Euclidean  norms  of  the  error  components  of  posi¬ 
tion  and  velocity,  and  the  square  root  oi  the  trace  of  the  appropriate 
covariance  submatrix  elements.  The  ensuing  discussions  will  refer  to  these 
quantities  as  RSS  (root-sum-square)  and  RTC  ( root-trace-covariance) , 
respectively. 


*  GMT  ■  Greenwich  Mean  Time;  see,  for  example,  (9). 
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Figure  5.7  Acceleration  Error  Components 
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Figure  5.8  Acceleration  Errors  in  x-s  Plane  (mm/sec2) 
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Nominal  Cases  With  and  Without  Model  Error.  Figures  5.9  through  5.12 
show  the  position  and  velocity  error  norm,  RSS,  for  the  nominal  data  of  Table 
5.1,  both  with  and  without  thrust  acceleration  modeling  error,  respectively. 
Clearly  the  estimation  performance  in  Figures  5.9  and  5.10  is  erratic  and 
unacceptable,  producing  estimation  errors  as  high  as  20,000  km  and  18  m/sec. 
The  rapid  decrease  in  the  error  covariances  is  illustrated  bv  the  short  dashed 
line,  the  RTC,  near  the  oriqin.  Figures  5.11  and  5.12  illustrate  the  case  of 
perfect  inodelinq.  Here  both  the  actual  and  assumed  thrust  accelerations  are 
of  constant  magnitude,  always  directed  along  the  orbital  frame  v  axis.  Iwo 
RSS  curves  are  shown  in  each  figure,  corresponding  to  two  different  values 
of  the  observation  error  variance  for  the  star-Earth  angle,  In  both  cases 
the  RSS  values  become  substantially  less  than  2  km  in  position  and  .004  m/sec 
In  velocity  as  the  estimation  process  proceeds. 

Perhaps  more  interesting  is  the  sensitivity,  shown  by  these  curves, 
of  the  estimation  process  to  differences  In  the  observation  error  variance. 

The  solid  line  in  Figures  5.11  and  5.12  represents  the  RSS  for  the  nominal 
(Table  5.1)  value  of  R  =  (v  arose o)?~.  The  dashed  line  corresponds  to  an 

v» 

increase  of  3  arcsec  resulting  in  Rf  --  (10  arosoo)2.  In  both  cases,  of  course, 
each  value  Is  used  for  both  the  true  (simulated)  and  assumed  (nominal)  value. 
This  change  In  the  value  of  R ^  illustrates  a  somewhat  surprlsinq  sensitivity 
of  the  estimation  accuracy  to  this  particular  measurement  type.  For  the 
Increased  r ^  the  position  RSS  is  more  than  twice  that  for  the  nominal  value. 
Clearly  similar  behavior  is  seen  for  the  corresponding  velocity  RSS. 

The  sensitivity  of  the  estimation  process  to  R and  the  problems  of 
actually  developing  equipment  capable  of  such  high  accuracies  prompts  the 
question  of  necessity.  Are  £  measurements  Indeed  necessary?  To  answer  this, 
consider  the  Information  obtained  from  a  single  range-rate  observation.  From 
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Figure  5.9  Position  Error,  No  Model  Error  Compensation 
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riqure  5.10  Velocity  Error,  No  Model  Error  Compensation 


Figure  5.12  Velocity  RSS  for  Nominal  and  Increased 
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equation  (1.3.21),  letting  j  =  k-1,  we  first  make  the  approximation  that 
♦..  *  l  for  small  A t,  so  that 

Dropping  the  subscripts  and  employing  the  p  elements  of  //  defined  In  Appendix 
C,  we  have  for  the  i  -  j  elements  corresponding  to  position  ft,  j  *  j,  2,  5;, 


[ffr  if1  =  [(xi  -  -  f^  -  xsi)(i>/p)]  • 

[fh  -  xsj)  -  (xj  -  ysyf(5/p;j/fp2  m 


(5.7.1) 


where  p  Is  the  range  from  the  tracking  station  to  the  vehicle.  xi  and  Xgi 
are  elements  of  r  and  v  respectively.  For  the  velocity  elements  ft,  j  - 

S 


4t  5,  <?;, 

[ht  if1  H].t  =  (x.  -  x  .)(x.  -  x  j/fp2  ffi.7.r; 

1  SI  J  SJ 

and  for  the  mixed  terms  ft  =  1,  2,  3 ;  j  -  <?,  5,  6) 

[Ht  R'1  U)ij  =  [(X.  -  Xs.)  -  (X.  -  Jfp/p>]  •  (Xj  -  Xsj)/(p2  K) 

(5.7.3) 


Now  consider  the  situation  where  the  vehicle  and  tracking  station  are  In  the 
ecliptic  (heliocentric  X-Y)  plane,  with  no  velocity  components  directed  out 
of  the  plane.  Clearly  the  information  derivable  from  the  range-rate  measure¬ 
ment  for  the  Z-component  (t  =  3)  of  position  and  velocity  Is  zero.  Now  while 
this  configuration  is  not  usually  the  case,  the  values  of  the  position  and 
velocity  7-components  of  both  the  vehicle  and  the  station  are  small  enough  to 
make  the  information  quite  small.  Thus  the  range-rate  measurement  contributes 
only  little  information  for  the  estimation  of  the  out-of-plane  components  of 
the  state.  (Note  that  the  Information  would  be  even  less  if  station  motion 
were  not  considered;  this  further  substantiates  the  claim  set  forth  in 
(51,  34).) 
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To  see  how  the  angle  measurement,  alleviates  this  problem,  consider 
the  corresponding  elements  of  the  Information  matrix,  again  for  one  measure¬ 
ment  and  with  the  approximation,  $  »  I.  For  the  position  elements  (i,  j  * 
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Here,  even  when  the  Z-component  of  position  is  zero,  the  corresponding  infor¬ 
mation  term  is,  in  general,  non- zero.  Only  when 


Is  the  information  term  zero.  Now  Z  is  zero,  since  the  Earth  remains  in  the 

e 

ecliptic,  and  z  is  very  small  compared  to  | R  -  r|.  Hence  the  approximate 
value  of  5  =  90°  results  In  a  corresponding  value  of  zero  for  the  information 
term.  This  value  can,  of  course,  be  encountered  in  practice.  However,  for 
the  periods  Investigated  In  the  simulation,  this  particular  geometrical  con¬ 
figuration  was  not  encountered.  (In  passing  we  note  that  the  values,  £  =  0,  n, 
could  lead  to  some  numerical  problems  due  to  the  division  by  sin  £.  However, 
for  the  navigation  star  selected  these  values  of  S  cannot  occur.) 

For  the  velocity  elements,  the  information  term,  htr~1h ,  is  zero  since 
the  corresponding  elements  of  H  are  zero.  Thus  estimates  of  the  Z-component 
of  velocity  must  rely  almost  solely  upon  the  Information  derived  from  the 
range  rate.  This  dependence  Is  manifested  by  errors  In  the  out-of-plane 
velocity  which  are  sometimes  as  much  as  an  order  of  magnitude  greater  than 
those  of  the  other  components.  However,  this  particular  problem  is  partially 
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alleviated  by  accurate  estimation  of  the  position.  This  is  true  because  the 
acceleration  is,  in  part,  a  function  of  position,  i.e.,  equation  1) . 

Thus  accurate  estimates  of  r  keep  the  nominal  velocity  values  fairly  accurate. 
More  directly,  the  elements  of  the  state  transition  matrix,  previously  assumed 
to  be  diagonal,  in  fact  contribute  a  coupling  of  information  elements  to  aid 
accurate  estimates  of  the  velocity. 

While  the  previous  discussion  has  been  directed  toward  the  Information 
obtainable  for  the  Z-components  of  the  state,  very  similar  conditions  and 
explanations  apply  for  cases  where  the  vehicle  is  on  or  very  near  the  X-  or 
Y-axes.  For  example,  initially  the  vehicle  is  on  the  heliocentric  X-axis. 

Only  one  station,  Madrid  (JPL-61),  is  initially  tracking.  The  vehicle  is 
almost  directly  overhead,  hence,  p  -  0,  and  (Y  -  Ys)  =  0.  Thus,  only  a  small 
amount  of  information  about  Y  is  available  since  Dp/DY  =  0.  Similarly,  for 
these  conditions,  X  -  X  ~  0,  and  hence  Dp/3,Y  =  0  yielding  little  information 
about  X.  However,  information  is  available  from  the  star-vehi  cl  e-Earth  angle, 
C,  which  does  aid  the  estimation  of  these  otherwise  locally*  unobservable  or 
nearly  unobservable  elements.  In  fact,  it  has  been  found  that  conditions  such 
as  these  occur  quite  often  at  various  times  throughout  the  mission.  The  result 
is  that  without  the  angle  measurement,  f.,  extremely  large  estimate  error 
values  often  occur. 

Thus  in  view  of  the  foregoing  discussions,  it  is  desirable  to  have 
an  additional  or  supplemental  measurement  type.  The  onboard  angle.  ,  appears 
to  be  a  viable  candidate  producing  useful  information  to  aid  the  orbit 
determination  process.  One  of  its  chief  disadvantages  is  the  high  resolution 

*  The  term  local  observability  might  be  suggested  for  those  state 
elements  which  are  observable  for  only  a  subset  of  the  total  observation  set. 
Thus  a  system  could  be  described  as  locally  observable  if  (l.c.22)  holds  where 
the  observations  {y^  ...,  y^}  form  a  proper  subset  of  V^. 
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demanded  to  produce  accurate  estimates.  Another  dlsadvantaqe  Is  discussed 
later. 


Model  0  -  State  Noise  Covariance.  A  number  of  simulations  were  per¬ 
formed  using  various  values  of  the  diagonal  state  noise  covariance  matrix, 
u%  (see  sections  1.2  and  5.6).  Typical  results  for  three  linearly  related 
values  of  u  over  a  60-day  period  are  shown  in  Figure  5.13.  Here  the  estima¬ 
tion  performance  is  reflected  by  the  position  error  RSS  for  the  following 
values  of  u: 

ux  =  K.16  x  10~6  .166  x  7<?~5  .16  x  10~6)T  (nrn/a tv2  > 2 

U2  =  T( .  16  x  io~ 10  .166  *  10~ 5  .16  x  10~10)T  (rrr:'p,'c':  ' : 

U3  =  I(.16  x  io~ 12  x  x  10~X7)T 

Examining  the  figure,  as  the  state  noise  covariance  is  decreased,  we  begin  to 
see  divergence  of  the  state  estimate.  In  fact,  for  all  cases,  including  that 
corresponding  to  the  largest  value,  ul%  divergence  is  at  least  starting  near 
the  end  of  the  simulation  period. 

A  particularly  interesting  feature  is  the  apparent  trade-off  in  the 
values  of  u  and  the  maximum  estimation  accuracy  obtained.  This  is  shown  by 
the  large  dips  in  the  RSS  curves  at  about  10  days.  For  the  smallest  value, 
u 3,  the  RSS  curve  has  the  greatest  dip;  for  ux  the  dip  is  smallest.  An 
intuitive  explanation  of  this  phenomenon  is  based  on  the  following  reasoning. 
Initial  filter  operation  produces  estimates  of  greater  and  oreater  accuracy 
as  more  observations  are  taken.  If  the  value  of  u  is  small,  the  error  co- 
variance  decreases  rapidly,  producing  more  nearly  optimal  estimates  and  thus, 
for  a  short  time,  more  accurate  estimates.  However,  eventually  the  covariance 
becomes  so  small  that  it  is  unable  to  cope  with  the  error  buildup,  and  di¬ 
vergence  occurs.  On  the  other  hand,  for  a  larger  v  the  error  covariance  does 
not  decrease  as  rapidly.  Hence,  the  filter  is  not  operating  near  the  optimum. 
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Figure  5.13  Position  Error,  Mcdel  0 
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and  estimate  accuracy  is  therefore  not  as  good  —  at  least  during  the  short 
period  around  10  days.  However,  the  presence  of  a  larger  u  keeps  the  filter 
operating  longer  before  divergence  occurs. 

Next  note  the  dotted  curve  which  corresponds  to  the  maximum  RTC,  i.e.. 
that  for  ux.  Its  average  value  is  approximately  70-80  km,  a  value  which  does 
not  reflect  the  error  RSS  of  300-400  km.  (Although  they  are  not  shown,  the 
RTC's  corresponding  to  U2  and  u3  all  lie  below  that  for  l'x.)  This  suggests 
that  the  value  of  U  be  increased. 

Such  an  increase  produces  the  estimation  results  shown  in  Fiqures 
5.14  and  5.15.  The  increased  elements  of  u  have  the  values, 

q  =  q  =  .  16  *  10  4  (mrn/soa2  )2 

X  z 

q  =  .166  *  10~ 3  (mm/ sea)2 

y 

These  figures  show  position  and  velocity  RSS  and  RTC  curves  for  both  the 
nominal  acceleration  error  and  an  increased  error,  to  be  discussed  later. 

In  Figure  5.14  the  position  error  RSS  is  seen  to  be  greater  than  that  obtained 
in  Figure  5.13.  However,  the  RTC  curve  is  a  better  measure  of  the  accuracy, 
even  though  it  does  not  bound  the  RSS  curve.  Further,  the  estimate  remains 
fairly  stable  throughout  the  simulation  period  of  60  days.  This  is  further 
supported  by  examining  Figure  5.15.  Here  the  solid  curve  represents  the 
velocity  error  RSS  over  the  60-day  simulation  period.  Although  there  are 
some  fairly  large  error  peaks  during  the  latter  30  days,  overall  the  velocity 
error  RSS  appears  generally  to  be  free  of  divergence  during  this  period. 

Fiqures  5.14  and  5.15  also  show  the  estimation  performance  for  the 
case  where  the  thrust  acceleration  error  is  increased  by  a  certain  amount  at 
30  days  into  the  mission.  This  increase  is  produced  by  changing  to  the 
following  acceleration  error  simulation  values: 


Nominal 
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Figure  5.15  Velocity  Error,  Model  0,  Nominal  and  Increased  Acceleration  Error  at  30  Days 
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Y  =  .026175  vad 
6cip  =  .0027  m/sca2 

This  corresponds  to  a  50%  increase  in  the  nominal  values.  A  plot  of  the 
corresponding  y-component  of  thrust  acceleration  error  is  shown  in  Figure 
5.16.  Figures  5.14  and  5.15  thus  reflect  the  ability  of  the  algorithm  to 
handle  changes  in  the  thrust  acceleration  error.  For  the  corresponding  position 
error  RSS,  the  differences  between  the  nominal  and  increased  thrust  accelera¬ 
tion  error  are  essentially  nonexistent,  and  are  therefore  shown  together  as 
the  solid  curve.  The  dotted  curve  is  the  RTC  for  both  also.  In  Figure  5.15, 
however,  the  velocity  error  RSS  curves  are  slightly  different,  with  the  RSS 
for  the  increased  acceleration  being  given  by  the  dashed  curve.  As  expected, 
the  error  is  slightly  larqer  than  that  for  the  nominal  acceleration  error 
simulation.  As  in  Figure  5.14,  the  dotted  curve  represents  the  velocity  error 
RTC  for  both  cases.  However,  a  curious  aspect  is  the  behavior  of  the  RTC 
curve.  This  is  seen  as  a  generally  increasing  trend  from  about  10  days.  The 
data  also  reflect  a  similar,  but  considerably  less  detectable,  behavior  for 
the  position  RTC.  One  possible  explanation  for  this  is  the  fact  that  as  the 
vehicle  moves  farther  from  the  Earth,  the  angle  measurement,  becomes  less 
sensitive  to  changes  in  position.  For  example,  initially  the  spacecraft  is 
approximately  9.25  *  10s  km  from  the  Earth.  The  standard  deviation  of  7 
arcsec  for  the  onboard  angle  measurement  corresponds  to  an  arc  length  at  this 
distance  of  approximately  31.4  km.  By  60  days  into  the  mission,  the  spacecraft 
is  approximately  1.53  x  107  km  from  the  Earth:  the  corresponding  arc  lenqth 
is  approximately  519  km.  While  this  behavior  could  be  seen  more  rigorous lv 
in  terms  of  the  information  matrix,  the  effect  is  clear:  less  information 
is  available  from  the  onboard  measurement  angle  as  the  distance  from  the 
Earth  is  Increased. 
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Figure  5.16  Acceleration  Error  (y-a/is).  Increase 


m 

Models  1  and  2.  I  i cjures  5.17,  5.18,  <md  5.19  show  the  estimation 
performance  for  each  of  the  other  four  approximating  models.  For  these 
cases,  the  thrust  acceleration  error  parameters  are  Increased  to  the  same 
values  as  for  Model  0,  above.  Further,  each  case  employed  values  of  ^  as 
Indicated  below. 

Iq  =q  =  .16  x  20”  4  (rm/eee2)2 

*  Z 

qe=  .16  x  10~10  (rm/see3)2 


Model  lb 


q  =  q  =  .16  x  10 ”4  (rm/sea2)2 

X  z 

q  =  .5  x  10  14  (rm/sec3) 2 
e 

q  =  .q  x  10"  35  sec" 4 


Model  2a 


q  =  .16  x  io~k  (mm/aec2)2 

z 

.3  x  10~2  4  (m/eec1*)2 


Model  2b 


'g 


q  =  .16  x  io~u  (rm/oec2)2 
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20*  2  2  (rm/cec") 2 
.5  x  22“ 30  8ej~6 


Figure  5.17  shows  the  position  error  RSS  for  each  of  the  model  con¬ 
figurations.  As  expected,  the  estimation  accuracy  generally  is  improved  over 
that  for  Model  0.  Most  remarkable  is  the  accuracy  with  which  Model  2b  yields 
performance  estimates.  This  is  represented  by  the  solid  curve,  and  corresponds 
to  an  average  position  RSS  of  about  60  km  (time  s  10  days).  This  approaches 
a  factor  of  5  reduction  In  the  RSS  values  for  the  other  models.  It  is 
Important  to  remember,  however,  that  Model  2b  corresponds  to  the  correct 
structure  of  the  actual  thrust  acceleration  error  magnitude,  and  thus  one 
would  expect  superior  performance.  We  remark  that  Model  2b  is  not  quite 
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figure  5.18  Velocity  Error,  Models  1  and  2,  Increased  Acceleration  at  30  days 
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Figure  5.19  Thrust  Acceleration  Error  (y  component) 

Increased  Error  at  30  Days 
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the  exact  structure  for  the  y-component  of  acceleration  error,  however. 

This  is  due,  of  course,  to  the  fact  that  the  thrust  acceleration  error  vector 
deviates  from  its  nominal  orientation  so  that  Model  2b  is  attempting  to 
approximate  the  y-axis  projection  of  the  thrust  acceleration.  Further,  we 
have  employed  a  differential  form,  and  consequently  truncation  errors  in  the 
numerical  integration  algorithm  contribute  what  appears  to  the  estimation 
equations  to  be  a  form  of  model  error.  It  Is  these  aspects  which  require  the 
use  of  a  small  state  noise  covariance  matrix  (also  see  (11)  ). 

The  next  most  accurate  performance  is  that  provided  by  Model  2a, 
simple  linear  form.  As  shown  by  the  alternately  dot-dashed  curve,  the  average 
position  RSS  is  approximately  300  km  (for  time  >  10  days). 

The  two  first  order  models,  la  and  lb,  produced  acceptable  estimation 
accuracy,  but  not  as  good  as  expected.  In  fact.  Model  lb  produced  a  position 
RSS  curve  of  about  450  km,  some  80  km  greater  than  that  for  Model  la.  One 
possible  reason  for  this  is  that  too  small  a  value  for  one  or  a  number  of 
the  state  noise  covariance  elements  was  used.  This  suggestion  is  supported 
by  the  fact  that  the  associated  error  RTC  data  indicated  an  approximate  steady 
state  value  of  280  km,  a  value  somewhat  below  the  actual  RSS  curve.  In  all 
cases  a  considerable  number  of  runs  were  made  for  each  model  to  obtain  appro¬ 
priate  values  for  the  state  noise  covariance  matrix  elements.  This  process  is 
tedious,  time-consuming,  and  expensive,  and  for  this  reason,  it  was  decided, 
following  some  effort,  not  to  pursue  this  "tuning"  process  further.  Clearly, 
one  would  expect  that  more  accurate  results  could  be  obtained  by  varying  the 
ql  values  until  the  position  error  RTC  and  RSS  become  approximately  the  same. 

For  each  of  the  other  models,  the  indicated  values  of  q.  resulted  in 
RTC's  which  coincided  with  or  exceeded  their  respective  RSS  data.  The  model 
producing  an  error  RTC  greater  than  its  RSS  data  was  Model  2b.  In  particular 


the  RTC  data  Indicated  an  average  value  of  approximately  295  km.  Presumably 
further  tuning  of  the  filter  would  result  in  even  greater  accuracy  for  Model 
2b.  For  the  reasons  cited  above,  this  was  not  done. 

Figure  5.18  shows  the  velocity  error  RSS  curves  for  each  of  the  models. 
Here  the  performance  appears  to  be  nearly  the  same  for  each.  However,  these 
curves  are  in  general  more  erratic  than  in  Figure  5.19.  In  particular,  toward 
the  end  of  the  simulation  interval  the  effects  of  the  increased  thrust  accel¬ 
eration  error  are  rather  pronounced.  In  spite  of  this,  there  does  not  seem 
to  be  any  indication  of  a  tendency  toward  divergence;  hence,  for  all  these 
models,  the  estimates  are  stable  over  the  entire  interval.  Further,  in  all 
cases  the  corresponding  RTC  data  bounded  or  coincided  with  the  RSS  data. 

Figure  5.19  shows  the  estimates  of  the  thrust  acceleration  y-component 
for  each  model.  These  estimates  are  those  which  resulted  in  the  estimation 
performances  of  Figures  5.17  and  5.18.  Each  estimated  acceleration  curve 
represented  by  the  heavy,  solid  line  is  identified  with  the  corresponding 
model  number.  The  estimated  curves  are  superimposed  over  a  lighter  curve, 
the  true  y-component  of  thrust  acceleration  error.  As  seen  from  the  figure, 
all  four  models  perform  admirably  in  representing  this  component  of  accel¬ 
eration  error.  Particularly  notable  is  the  fact  that  all  methods  adapt  to 
the  increase  in  the  acceleration  error  at  30  days.  Now,  the  plots  for  Models 
la  and  2b  do  not  appear  to  be  very  different  (although  the  position  and 
velocity  RSS's  in  Figures  5.17  and  5.18  indicate  otherwise).  On  the  other 
hand  consider  the  respective  curves  for  Models  lb  and  2a.  Close  examination 
of  the  Model  lb  curve  reveals  that  a  number  of  the  actual  error  peaks  are  not 
matched  by  the  approximating  curve.  This  is  possibly  due  to  a  smaller  than 
necessary  state  noise  covariance  matrix  and  presumably  the  estimates  would 
be  improved  if  this  matrix  were  increased  in  value.  This  further  supports 


the  suggested  explanation  why  Model  lb  yields  the  largest  position  error  R'/> . 
Next,  examining  the  Model  2a  curve  reveals  that  the  estimated  values  near 
the  error  peaks  are  larger  than  the  actual  error  (the  breaks  in  the  approxi¬ 
mating  curve  Indicate  where  the  values  extend  outside  the  limits  of  the 
graph).  In  view  of  the  previous  suggestion  concerning  Model  lb  that  the 
state  noise  covariance  is  too  small,  the  implication  hero  is  that  the  cor¬ 
responding  covariance  is  too  large.  In  this  case,  however,  it  has  been  found 
that  the  position  error  RTC  data  coincides  very  closely  with  the  I’SS  curve. 
Further,  the  fact  that  the  Model  2b  simulation  yielded  R1C  values  which  were 
well  above  its  RSS  values,  tends  to  discount  this  explanation  for  over¬ 
estimating  the  peaks.  The  actual  reason  for  this  behavior  is  not  clear.  Two 
possible  explanations  are  offered.  First,  it  is  possible  that  generally  over¬ 
estimating  the  acceleration  error  simply  results  in  orbit  determination 
performance  which  is  more  accurate  than  that  resulting  from  under-estimating 
the  error.  Second,  the  plots  shown  are  generated  using  approximately  every 
fourteenth  or  fifteenth  point.  Thus  many  values  occur  which  are  not  shown, 
and  on  thu  autvtage,  it  is  possible  that  considerably  better  estimates  of  the 
error  are  obtained  for  Model  2a  than  for  Model  lb. 

In  view  of  the  foregoing  discussions,  it  appears  that  generally  the 
second  order  models,  2a  and  2b,  are  superior  in  representing  thrust  accel¬ 
eration  errors  of  the  type  encountered  in  the  simulations.  Further,  the 
importance  of  exact  modeling  to  the  maximum  extent  practical  is  clearly 
illustrated  by  the  exceptional  performance  of  Model  2b. 

Sequential  Estimation  of  the  State  Noise  Covariance.  We  have  seen 
rather  accurate  performance  using  an  a  pfUon.i  constant  state  noise  covariance 
matrix  associated  with  each  of  the  models.  However,  in  view  of  the  many 
numerical  simulations  which  invariably  must  be  performed  in  order  to  "tune" 
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the  state  (vector)  filter,  a  natural  alternative  Is  to  employ  the  sequential 
state  noise  covariance  or  ^-filter  to  aid  In  this  process.  This  has  been 
quite  successful  as  we  now  proceed  to  show. 

First,  some  comments  dealing  with  general  performance  are  In  order. 
Initially,  numerous  runs  were  made  to  investigate  the  effectiveness  of  the 
sequential  3-filter  In  these  cases  operation  of  both  the  ordinary  state 
filter  and  the  3-filter  was  simultaneously  initiated.  Results  were  obtained 
for  all  models  and  were  less  than  satisfactory.  In  all  of  these  cases,  the 
performance  was  worse  than  that  obtained  by  simple  tuning  of  the  state  filter 
using  a  pfUo^u  constant  values  of  Q.  Attempts  were  made  tc  improve  the 
performance  of  the  3-filter  by  employing  different  a  ptiioii  values  of  both 
3  and  the  3-error  covariance,  3;  using  different  constant  values  of  the 
observation  residual  error  variance,  r;  and  using  the  "sliding  window"  modi¬ 
fication  to  the  '/’-estimator,  equation  (3.6.36).  These  attempts  met  with  little 
success.  After  exhausting  such  approaches,  a  re-examination  of  the  strategy 
for  employing  both  the  state  and  3-fllters  was  made.  This  revealed  that 
possibly  the  initial  transient  operation  of  the  state  filter  was  having  a 
detrimental  effect  upon  the  3-filter  performance.  To  investigate  this,  runs 
were  made  in  which  in  which  initiation  of  the  3-filter  operation  was  post¬ 
poned  for  the  transient  period  of  state  filter  operation,  approximately  5 
days.  The  results  were  highly  encouraging. 

To  illustrate  this  strategy,  cases  for  Models  lb  and  2b  were  run. 

Recall  from  previous  discussion  that  the  results  obtained  in  Figures  5.17  - 
5.19  suggested  that  the  state  noise  covariance  values  appeared  to  be  too  small 
and  too  large,  respectively,  for  these  models.  For  this  reason,  it  was  felt 
that  the  3-fllter  could  improve  the  values  of  3.  and  hence  the  estimation 
performance.  For  both  models,  simulations  were  run  for  30  days  using  the 
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Initial  data  given  In  Table  5.1.  In  addition,  a  pfUofU  values  of  Q  were 
taken  to  be  those  of  u  corresponding  to  the  results  obtained  In  Figures  5.17  - 
5.13*  (see  page  198).  The  initial  values  of  the  Q  error  covariance  matrix, 
r>t  were  selected  to  make  :;0  diagonal.  The  s0  elements  corresponding  to 
diagonal  elements  of  Q  were  chosen  approximately  as  the  square  of  these  a 
puiani  Q  values.  The  S0  elements  corresponding  to  off-diagonal  or  correlation 
terms  of  Q  were  taken  (approximately)  as  the  product  of  the  a  pkivtii  diagonal 
Q  elements  corresponding  to  the  row  and  column  of  the  correlation  term.  Thus, 
at  t  =  ot 

8.,  -  q.  2  i  8..  ~  q  q  ( 5.7.1 W 

ii  j  *  kk  H mm  'nn 

Taking  this  approach  was  an  arbitrary  choice.  However,  it  was  found  that 
relatively  accurate  values  for  s0  are  necessary,  and  the  rule-of-thumb  given 
by  (5.6. S)  was  found  to  be  a  viable  approach.  In  view  of  this,  the  a  pxieni 
S  values  are  given  in  Table  5.2  for  each  of  the  two  b  models. 

Because  of  the  extremely  small  values  of  many  of  the  t* _ ,  different 
units  are  used  in  carrying  out  these  simulations,  namely,  millions  of  meters 
and  millions  of  seconds.  This  is  done  in  order  to  avoid  numerical  difficulties. 
In  order  to  test  the  adaptability  of  the  Q  filter,  increased  acceleration 
errors  were  simulated  at  15  days.  This  was  done  by  again  increasing  y  and 
6a0  by  50%  to 

y  *  .026175  rad 
6a0  =  .0027  m/sec2 

In  determining  the  observation  residual  error  variance,  T,  the  sliding  window 
modification,  equation  (3.6.36)  was  used  with  the  small  a  p/iieii  value  of 

*  Recall  the  discussion  in  section  5.5  associated  with  the  computation 
of  T.  Because  T  itself  is  required,  equation  (1.2.30)  is  used,  and  hence 
we  assume,  approximately,  Q  *  U. 
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y#  *  I0~lk  (m/8eo)k.  Twenty  residuals  were  used  as  the  residual  batch  size, 
i.e.,  N  0 ,  in  Ci.fi.  xc>) .  Finally,  in  carrying  out  the  simulations,  the 
U  filter  was  not  operated  until  the  fifth  day  of  'nission  time  in  order  to 
avoid  the  transient  effects  of  state  filter  operation. 

Figure  5.20  shows  the  position  RSS  and  RTC  for  both  b  models:  5.20a 
for  Model  lb,  and  5.20b  for  Model  2b.  In  both  cases  the  performance  is  quite 
good.  Figure  5.20a  shows  a  particularly  significant  increase  in  performance 
over  that  shown  in  Figure  5.17.  The  RSS  curve  has  been  lowered  to  an  approxi¬ 
mate  mean  of  125  km,  a  considerable  improvement  over  the  previous  450  km  of 
Figure  5.17.  Further,  the  RTC  curve  is  more  representative  of  the  actual 
RSS  error,  although  it  appears  to  be  slightly  low.  For  Model  2b  there  does 
not  seem  to  be  any  noticeable  improvement  in  the  estimate  as  reflected  in  the 
RSS  curve.  Both  here  and  in  Figure  5.17  the  RSS  has  a  mean  of  around  60  kin. 
However,  the  corresponding  position  error  RTC  is  now  an  accurate  representa¬ 
tion  of  the  position  error,  and  adequately  bounds  the  actual  error  RSS.  Note 
that  for  both  models  the  filters  easily  handle  the  50%  acceleration  error 
increase  at  15  days. 

Results  obtained  for  the  velocities  are  shown  in  Figures  5.21a  and 
5.21b.  The  performance  is  comparable  to  that  shown  in  Figure  5.18.  However, 
for  Model  lb  the  velocity  error  RSS  curve  has  some  peaks  which  are  larger 
than  those  of  Figure  5.18b.  These  peaks  seem  to  be  decreasing  in  amplitude 
after  the  largest  one  at  21  days,  and  therefore  do  not  suggest  divergence. 
However,  the  RTC  is  somewhat  lower  than  the  RSS  curve,  and  it  is  possible  that 
divergence  may  appear  at  some  later  time.  As  before,  the  results  for  Model 
2b  are  particularly  good,  although  here  also  there  is  a  large  error  peak  at 
about  21  days.  Again  the  RSS  decreases,  and,  except  for  the  peak,  the  RTC 
mean  of  about  .2  m/sec  adequately  reflects  the  true  error. 
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5.21a  Velocity  Error  RSS  and  RTC  for  Model  lb 
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5.21b  Velocity  Error  RSS  and  RTC  for  Model  2b 


Figure  5.21  Velocity  Error  RSS  and  RTC 
With  Sequential  Q  Estimation 
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It  Is  entirely  possible  that  the  peaks  In  the  error  RSS  curves  may 
be  due  to  operation  of  the  filters  as  they  attempt  to  adapt  to  the  accel¬ 
eration  error.  Figures  5.22a  and  5.22b  suggest  this  possibility  as  they  show 
the  true  and  approximated  thrust  error  accelerations.  In  particular.  Figure 
5.22a  shows  that  the  Model  lb  error  approximation  is  slightly  out  of  phase 
with  the  true  error.  Further,  it  is  smoother  and  does  not  reflect  the  more 
erratic  behavior  of  the  actual  thrust  acceleration  error.  On  the  other  hand. 
Model  2b  does  a  reasonably  good  job  of  approximating  the  true  error.  In 
either  case  there  are  rapid  alternations  at  the  true  error  maxima  and  minima 
which  are  difficult  to  approximate  and  which  are  therefore  quite  capable  of 
resulting  in  the  peaks  in  the  position  and  velocity  error  RSS'. 

Figures  5.23  and  5.24  are  plots  of  appropriate  RTC  values  for  the 
state  noise  covariances.  These  figures  illustrate  quite  well  the  expected 
changes  In  the  state  noise  covariance  values.  Figures  5.23a  and  5.23b  show 
the  respective  Model  lb  and  2a  state  noise  covariance  RTC  curves  for  the 
orbital  frame  x-z  components  of  error.  Note  that  both  curves  reflect  the 
constant  a  p>Uo>U  values.  At  5  days  there  is  a  rapid  incAcaic  in  the  RTC 
value  for  Model  lb  and  a  dectcaae  in  the  RTC  for  Model  2b.  This  behavior  is 
exactly  that  expected  to  improve  the  position  and  velocity  RSS'.  Also  notable 
is  the  Increase  in  these  Q  estimates  at  about  15  days,  corresponding  to  the 
Increased  thrust  acceleration  error.  Thus  the  Q  filter  is  actually  adapting 
its  estimates  to  account  for  these  error  increases.  This  behavior  can  also 
be  seen  in  Figures  5.24.  Figure  5.24a  shows  the  state  noise  covariance  RTC, 

,  corresponding  to  the  noise  term  in  the  approximating  differential 
equation  ( 5.5. 5 ).  Likewise,  Figure  5.24b  shows  the  covariance  RTC,  , 
corresponding  to  the  noise  term  In  (5.5.10).  We  again  see  the  constant  a 
pfUoru.  Q  values  for  both  cases  during  the  first  5  days  of  the  mission.  At 
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this  point  changes  occur  with  Increasing  and  ^  decreasing  as  expected. 
Again  at  15  days  both  estimates  begin  growing  as  the  Q  filter  attempts  to 
account  for  the  increased  model  error. 

For  the  model  parameters,  a  and  B,  the  corresponding  state  noise 
covariance  estimates  retained  their  a  ptUoni  values  with  no  discernible 
changes  as  the  estimation  proceeded.  This  is  attributed  to  the  very  small 
a  puLotu.  values  of  s10  10  corresponding  to  these  parameters  (Table  5.2). 

Recall  from  Chapter  4  that  too  large  a  value  of  Q  for  the  model  parameter, 
resulted  in  <?'s  following  the  observations,  thus  destroying  the  correlation 
effects.  In  view  of  this,  the  suggestion  is  made  that  it  is  better,  within 
parexic  bounds,  to  let  such  parameters  tend  more  to  their  constant  values. 

This  strategy  is  further  motivated  by  the  initial  results  obtained  when  both 
the  state  and  Q  estimations  were  initiated  simultaneously.  Thus  initial  values 
of  .?  normally  should  be  chosen  as  nearly  representative  of  the  true  uncertain¬ 
ties  in  Q  as  possible.  An  interesting  aspect  of  the  Q  filter  performance  was 
noted  in  using  constant  values  of  the  residual  variance,  7’,  as  opposed  to 
estimating  them  via  (3.6. 36).  Results  indicated  that  variations  in  r  by  as 
much  as  three  orders  of  magnitude  produced  essentially  no  overall  change  in 
estimation  accuracy  from  that  obtained  In  the  foregoing  cases.  Generally, 
however,  it  is  recomnended  that  T  be  estimated,  thus  eliminating  additional 
a  pnlofu  guesswork. 

The  main  disadvantage  of  the  Q  filter  appears  to  be  the  increased 
demand  upon  computation  time.  This  can  be  severe,  particularly  for  more 
complex  models.  For  example,  the  two  models  treated  here  required  nearly 
50%  more  execution  time  than  that  without  the  sequential  Q  estimator.  This 
aspect  is  substantial  motivation  for  keeping  the  approximation  structures 
relatively  simple. 
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As  a  result  of  the  application  of  the  sequential  Q  estimator,  we 
have  shown  that  it  is  possible  to  improve  the  knowledge  and  confidence  in 
the  state  estimates.  It  is  most  important,  however,  that  the  Q  filter  opera¬ 
tion  be  Initiated  after  the  transient  period  of  the  state  filter  is  passed. 

5.7  Summary  and  Conclusions 

In  this  chapter  we  have  investigated  the  performance  of  model  error 
and  state  noise  covariance  estimation  algorithms  as  applied  to  the  orbit 
determination  problem  of  an  SEP  spacecraft  subject  to  anomalies  in  the  thrust 
program.  We  have  established  that  some  supplementary  measurement  type  is 
useful  in  addition  to  the  normal  radar  range  rate  observations.  The  use  of 
the  onboard  star-vehicle-Earth  angle,  while  providing  such  information,  suffers 
two  main  disadvantages.  First,  high  measurement  accuracies  are  necessary  to 
make  the  technique  useful.  Second,  as  the  distance  of  the  vehicle  to  the 
Earth  Increases,  the  measurement  angle  uncertainties  translate  into  larger 
position  uncertainties.  This  latter  problem  suggests  some  alternative  approach, 
such  as  switching  to  the  target  body  (in  this  case,  the  asteroid)  as  the 
approach  phase  is  entered.  This  would  provide  the  reverse  effect  with  greater 
accuracy  being  obtained  as  the  vehicle  approaches  the  target.  Another  alter¬ 
native  is  to  employ  a  different  measurement  type.  A  potential  candidate  is 
quasi,  very  long  baseline  interferometry  (QVLBI).  This  has  been  shown  to 
offer  greatly  Improved  observation  accuracies  over  that  obtained  with  con¬ 
ventional  range  rate  tracking  (52),  although  additional  development  is 
required  to  make  It  practical. 

The  use  of  a  state  noise  covariance  matrix  (Model  0)  to  maintain 
filter  operation  has  been  shown  to  be  effective  in  preventing  divergence  of 
the  estimates,  and  can  yield  accuracies  of  less  than  500  kni  position  error 
RSS.  In  general,  the  performance  can  be  Improved  using  any  of  the  other 
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models  investigated,  although  the  second  order  models  (2a  and  2b)  appear  to 
offer  greater  accuracy  and  stability  of  the  estimates.  In  particular,  the 
superior  performance  (RSS  *  70  km)  of  Model  2b,  being  the  closest  in  structure 
to  the  actual  error,  confirms  the  importance  of  accurate  model inq  to  the 
maximum  extent  practical. 

In  all  cases,  a  state  noise  covariance  matrix  is  required  in  order 
to  prevent  filter  saturation  and  subsequent  divergence.  However,  with  the 
estimation  of  the  model  errors,  the  state  noise,  and  therefore  the  error 
covariances,  are  smaller  than  without  model  error  estimation. 

Application  of  the  sequential  Q  estimator  has  been  found  to  be  a 
workable  approach  for  estimating  unknown  state  noise  covariances.  For  the 
error  approximation  Model  lb,  the  Q  filter  greatly  improved  the  RSS  error, 
decreasing  It  from  450  km  to  about  120  km.  The  state  RTC  in  both  cases  has 
been  shown  to  be  an  adequate  reflection  of  the  confidence  in  the  estimates, 
particularly  in  the  case  of  Model  2b,  where  the  RTC  practically  bounds  the 
RSS.  Of  paramount  importance  in  employing  the  state  and  i,1  filters  is  to 
insure  that  Q  filter  operation  is  not  initiated  until  after  the  transient 
period  of  state  filter  operation.  A  potential  problem  is  that  of  the  choice 
of  units  for  the  dual  state  and  i,1  filter  computations.  As  reflected  in 
Table  5.2,  very  small  numerical  values  can  be  encountered  for  the  error 
covariance  matrix.  s,  and  these  must  be  compatible  with  the  computational 
capabilities  of  the  particular  computer  to  be  employed.  Further,  making  the 
a  pnxon.i  s  too  large  can  result  in  estimates  following  the  observation 
residuals,  thus  at  least  delaying  accurate  convergence  of  the  estimates. 
Finally,  the  sequential  Q  estimator  suffers  the  disadvantages  of  requiring 
up  to  50%  more  computation  time  over  that  for  just  the  state  filter  with  a 
pfuo>U  constant  Q  values. 


In  sum,  we  have  shown  that  the  thrust  acceleration  errors  which, 
unchecked,  produce  extreme  divergence  of  the  state  estimates,  can  be  very 
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adequately  controlled.  The  techniques  employed  not  only  Increase  the 
navigation  accuracy,  but  also  knowledge  of  the  actual  dynamic  model. 


Chapter  6 


CONCLUSIONS  AND  RECOMMENDATIONS 
6.1  Summary  and  Conclusions 

This  dissertation  has  been  concerned  with  solving  the  fundamental 
problem  of  dynamic  modeling  errors  In  classical  Kalman  filtering.  The  very 
detrimental  effects  of  such  errors  have  been  Illustrated  In  both  algebraic 
and  numerical  terms  through  the  examples  of  the  rendezvous  and  SEP  vehicle 
estimation  problems.  In  approaching  the  model  error  problem,  various  aspects 
and  properties  of  the  filtering  elements  have  been  presented,  e.g..  Infor¬ 
mation,  observability,  etc.,  thus  providing  a  fundamental  base  for  the 
investigation.  A  number  of  non-adaptive  algorithms  have  been  presented 
which  attempt  to  account  for  errors  in  the  dynamic  model.  These  range  from 
the  optimal  approaches  of  employing  a  state  noise  covariance  matrix  and  the 
more  complex  limited  memory  filter,  to  the  suboptimal  techniques  of  age¬ 
weighting,  additive  gain  term,  and  gain  scaling  modifications.  In  all  cases, 
these  approaches  compensate  for  the  model  error  by  simply  keeping  the  gain 
at  a  high  enough  value  to  prevent  saturation,  and  thus  allow  continued  filter 
operation.  There  is,  however,  a  certain  minimum  estimation  accuracy  obtainable, 
reflected  In  part  by  the  steady  state  value  of  the  error  covariance,  and 
determined  by  the  corresponding  filter  parameters. 

Except  for  the  limited  memory  filter,  adaptive  forms  of  each  of  these 
error  compensation  techniques  have  been  presented.  Using  the  adaptive  state 
noise  covariance  estimation  algorithm  due  to  Jazwlnski,  adaptive  forms  were 
derived  for  each  of  the  other  methods. 
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The  powerful  technique  of  adaptively  estimating  modeling  errors  has 
been  presented,  along  with  discussions  of  the  assumed  functional  form  or 
mathematical  structure  of  these  errors.  A  unifying  derivation  has  been 
presented  resulting  in  a  relatively  flexible  structure  suggested  as  a  can¬ 
didate  function  for  use  in  model  error  compensation.  Additionally,  a  new 
algorithm  for  sequentially  estimating  state  noise  covariances  has  been 
developed,  based  upon  the  ordinary  Kalman  filtering  algorithm.  In  view  of 
the  non-negative  definite  property  of  the  state  noise  covariance,  and  certain 
other  assumptions,  the  algorithm  produces  constrained  minimum  variance 
estimates. 

In  applying  the  techniques  to  the  rendezvous  and  SEP  estimation 
problems,  numerical  results  have  offered  some  interesting  conclusions.  From 
the  rendezvous  problem,  the  performance  of  the  ordinary  gain  scaling  algo¬ 
rithm  (b  -  constant)  clearly  indicates  that  this  method  is  unacceptable  to 
counter  the  effect  of  unbounded  model  errors.  This  is  true  for  both  the 
adaptive  and  non-adaptive  forms.  In  their  non-adaptive  forms,  the  other 
algorithms  have  been  shown  to  produce  equivalent  steady  state  performance 
when  appropriate  filter  parameter  values  are  employed.  This  is  further 
substantiated  by  the  adaptive  formulations  for  which  the  performances  are 
nearly  the  same.  In  fact,  Jazwinski's  state  noise  covariance  estimation  and 
the  adaptive  age-weighting  algorithms  produce  identical  performance.  Further, 
the  modified  gain  scaling  and  additive  gain  term  methods  in  their  adaptive 
forms  also  yield  identical  performance.  Between  the  two  sets,  the  former 
algorithms  appear  to  offer  slightly  greater  confidence  in  their  estimates. 

The  reason  for  this  is  not  clear.  However,  one  clue  is  that  the  former  has 
filter  parameters  ( Q  and  e)  which  can  take  on  infinitely  large  values;  the 
latter  algorithms  have  parameters  (ot  and  B)  ranging  between  zero  and  one. 
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In  view  of  this,  the  conclusion  suggested  Is  that  the  differences  may  be  due 
to  numerical  sensitivities,  a  condition  brought  on  by  finite  computer  word 
length.  Thus,  In  the  latter  case,  considerably  more  significant  figures 
are  required  to  achieve  the  same  results  as  for  Q  and  s. 

Results  obtained  for  the  estimation  of  model  errors,  as  applied  to 
the  rendezvous  problem,  clearly  indicate  the  superior  performance  of  this 
method.  This  is  to  be  expected,  since  It  is  the  only  method  which  actually 
attempts  to  Improve  knowledge  of  the  dynamic  model.  Coupled  with  an  appro¬ 
priately  determined  state  noise  covariance  matrix,  this  technique  is  thus 
the  best  approach  to  the  model  error  problem.  For  this  reason,  this  approach 
was  taken  for  Investigating  the  solar  electric  propulsion  mission.  The 
accuracy  of  this  approach  In  the  SEP  orbit  determination  study  is  demonstrated 
by  the  performance  for  the  various  assumed  model  structures.  In  general, 
higher  order  models  are  to  be  preferred  since  their  structure  tends  to  make 
them  more  adaptable  to  complex,  as  well  as  simple,  error  forms.  This  is 
substantiated  by  the  more  accurate  estimation  performance  of  the  second  order 
models.  In  view  of  the  structure  of  the  SEP  acceleration  error,  and  the 
rather  remarkable  performance  of  Model  2b,  the  importance  of  accurate  but 
parexic  modeling  cannot  be  under-emphasized. 

The  sequential  estimator  for  the  state  noise  covariance  matrix  can 
be  effective  in  maintaining  operation  of  the  Kalman  filter,  both  with  and 
without  the  formulation  for  estimating  model  errors,  although  it  is  rather 
demanding  of  computation  time.  For  the  rendezvous  problem,  the  estimates 
of  the  state  noise  covariance  were  found  to  be  somewhat  erratic,  indicating 
a  rather  acute  sensitivity  to  the  residuals.  In  the  SEP  mission  simulation, 
the  strategy  of  avoiding  Q  filter  operation  during  the  state  filter  transient 
periods  proved  to  be  the  key  to  success  in  this  application.  Thus,  performance 
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In  the  rendezvous  problem  could  probably  be  Improved  using  this  philosophy. 
Further,  In  the  rendezvous  application,  the  T  estimation  utilized  all  a 
pohtwLonl'E  residuals,  thus  including  the  effects  of  transient  state  filter 
operation.  For  the  SEP  mission,  the  modification  of  the  r  estimator  limiting 
It  to  a  batch  of  20  residuals  appeared  to  be  a  more  viable  approach.  In  this 
configuration  the  Q  filter  operating  in  the  state  filter,  post- transient 
period  was  found  to  be  quite  effective  In  improving  state  noise  covariance 
values  for  Models  lb  and  2b.  The  success  obtained  with  this  strateny  suggests 
that  the  dual  state-G  filter  algorithm  can  be  a  useful  estimation  tool. 

6.2  Recommendations  for  Further  Investigation 

One  of  the  most  apparent  questions  which  remains  unanswered  is  that 
of  performance  of  the  suboptimal  adaptive  and  non-adaptive  algorithms  for  the 
case  of  vector  state  variables.  Some  reflection  will  reveal  that  the  algo¬ 
rithms  could  not  be  expected  to  be  equlvalenced  in  the  same  sense  as  for  the 
scalar  case,  v-tz. ,  the  rendezvous  problem.  For  example,  the  age-weighting 
algorithm  simply  multiplies  the  error  covariance  by  a  scalar,  thus  scaling 
each  element  by  the  same  amount.  The  additive  gain  term,  however,  adds 
different  terms  of  varying  value  to  the  error  covariance. 

The  additive  gain  term  algorithm  suffers  a  disadvantage  in  that  the 
added  term, 

3  R  HT/H  HT 

affects  the  gain  and  error  covariance  only  for  those  state  elements  which 
appear  explicitly  In  the  observation  state  relationship.  As  indicated  in 
section  2.2,  the  gain  scaling  modification  using 


b  =  i+±JL- 
H  P  HT 
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produces  the  gain  expression  of  (2.2.15)  which  differs  from  that  for  the 
additive  gain  (2.3.2)  by  the  presence  of  the  error  covariance,  T.  As  long 
as  P  Is  of  full  rank,  the  former  gain, 

k  =  (p  ht  +  8  rp  ht/h  p  ht)(h  p  nT  +  io'x  no 

Is  altered  for  alt  terms  In  the  state  vector,  regardless  of  their  explicit 
appearance  or  absence  In  the  measurement  equation.  Thus  the  original  additive 
gain  term  cannot  be  successfully  applied  to,  say,  the  model  error  estimation 
algorithm;  the  modified  gain  (2.2.15)  can.  In  light  of  the  fact  that  for  the 
scalar  problem  the  P  divides  out,  this  aspect  was  not  investigated  in  the 
rendezvous  problem.  However,  any  vector  problem  (say,  the  rendezvous  problem 
where  range,  p,  is  Included  with  range  rate,  p,  to  form  a  2  vector)  could  be 
used  to  Investigate  this  aspect. 

Similar  conditions  occur  for  Jazwinski's  Q  estimation  algorithm.  Here 
the  pseudo  inverse, 

a*  =  JL  =  rT  ht 

A  Ar  H  T  VT  HT 

usually  suffers  the  same  malady  as  the  additive  gain  term.  By  employing  a 
different  pseudo  inverse,  say, 

0#,-EiL 

AP  Ar 

state  elements  not  In  the  measurement  equation  will  still  have  corresponding 
non-zero  state  noise  covariance  elements. 

In  this  study,  the  common  structure  given  by  the  function  of  equation 
(3.4.6)  has  only  been  suggested  for  use  in  estimating  model  errors.  It  should 
be  particularly  Interesting  to  Investigate  the  ability  of  this  common  structure 
to  adapt  to  different  model  error  functions,  both  bounded  and  unbounded. 
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Considerable  study  of  the  sequential  state  noise  estimator  remains 
to  be  done,  particularly  with  regard  to  sensitivity  analyses.  Some  sensi¬ 
tivities  have  been  examined  with  regard  to  various  a  puloKi  values  of  s  and 
T.  However,  additional  study  Is  warranted. 

Finally,  we  reiterate  that  some  of  the  algorithms  may  be  combined  to 
obtain  hybrid  techniques.  The  approaches  here  are  many.  One  possibility 
Is  to  apply  the  suggested  modified  gain  scaling  technique  (section  2.2, 
with  the  gain  given  by  (2.2.15))  to  the  model  error  estimation  technique. 

This  approach,  while  suboptlmal,  would  require  only  the  selection  of  the 
filter  parameter,  6,  rather  than  three  or  more  state  noise  covariance  elements. 
Further,  the  adaptive  form  for  estimating  3  could  be  employed,  thereby  making 
the  algorithm  completely  adaptive. 

Our  concern  In  this  study  has  been  to  investigate  concepts  and 
techniques  for  alleviating  the  dynamic  model  error  problem  in  linear  filtering. 
Many  approaches  have  been  discussed  and  analyzed,  and  advantages  and  dis¬ 
advantages  presented.  The  filtering  and  estimation  process  —  even  in  its  most 
sophisticated  forms  —  can  never  be  any  better  than  the  mathematical  model 
representing  the  dynamic  process.  In  the  final  analysis,  then,  there  can  be 
no  substitute  for  accurate  modeling,  whether  analytically  determined  or 
adaptively  estimated. 
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MATRIX  INVERSION  LEMMA  (SCHUR  IDENTITY) 


Define  the  positive  definite  matrix,  x ,  as 
x  =  (at  b  a  +  cf1 


( i.i ) 


Taken  the  Inverse  and  pre-multlply  by  x  to  get 


T=XABA+XC 


Y/1.  r  > 


Post-multlply  by  c"1: 


c'1  =  X  AT  B  A  c'1  +  X 


(A.V 


X  =  c"1  -  X  AT  B  A  c"1 


Post-multlply  (A. 3)  by  AT  B  to  obtain  the  following  sequence: 
C'1  AT  B  =  X  AT  B  A  c'1  AT  B  +  X  AT  B 
*  X  at  B(A  c_I  at  B  +  I) 

=  X  AT  B(A  C ~1  Ar  +  B ~1)  B 
Post-multlply  by  b"j: 

C1  AT  =  X  AT  B(A  C1  AT  +  B'1) 


X  AT  B  =  c’1  AT  (A  c"1  AT  +  B'1)’1 


Substitute  (A. 4)  Into  (A.S)  and  solve  for  x ,  thus  obtaining  an  expression 


equivalent  to  (A A) 

X  =  c'1  -  c'1  at  (a  c~z  at  +  bT1)  a  c "j 


(A  A) 


N 
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A  AND  B  MATRIX  ELEMENTS 


AJ fotrlx:  Partition  the  A  matrix  Into  sub-matrices  as  follows 

"o  I  0 

A  ■  A21  0  A23 

0  0  A33 

m  m 

These  elements  are  given  as 

1  =  3x3  identity  matrix 
A*1  -  S  i  -  4,  5,  6;  j  =  1,  2t  3 

^2  3  *  la^]  ;  i  =  4t  5 ■,  6;  j  =  7,  8,  9 

i  i,  Q  =  7,  8,  9 

where 
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a 4  7  *  “Y/\p\  ,  asy  =  X/\p\  ,  a._.  =  0  for  other  i, 

e  Is  the  estimated  thrust  acceleration  error,  and  |p|  =  (X2  +  y2/* 
Model  la:  aij  ~  0  »  =  ?,  8,  9 

Model  lb:  am  =  -a.  ,  ayn  =  -e  ,  -  0  for  other  itj 

Model  2a:  £27  8  *  i  t  “  0  for  other  itj 

Model  2b:  ayt  -  1  »  ati  *  -0  ,  £289  *  -e  t  £2^  =  0  for  other  itj 

For  Model  0,  e  =  0  In  a2i,  and  A23  =  Ai3  -  0. 

B  Matrix:  The  B  matrix  Is  defined  in  partitioned  form  as 


where 

'X/\p\  -Y/\p\  O' 

M  -  Y/\p\  X/\p\  0 

.0  0  1  . 

which  Is  the  transformation  matrix  from  the  orbital  frame  to  the  heliocentric 
frame. 
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Appendix  C 


OBSERVATION  -  STATE  RELATION  PARTIAL  DERIVATIVES  (H  MATRIX) 


Range  Rate  Measurements,  p: 


[«i  -  xsl)  -  <xi 


x.)  Cp/pj]  /p  , 

SI 


ul  -  xsil  /P 


i  *  1,  2,  3 


where  /  and  /  are  the  heliocentric  position  and  velocity  components  of  the 
vehicle;  and  X  .  and  X  .  are  the  heliocentric  position  and  velocity  components 

SI  SI 

of  the  tracking  station.  The  derivatives  with  respect  to  the  other  state 
elements  (depending  upon  the  approximating  model)  are  zero. 


Star- Vehicle-Earth  Angle,  £: 


11. 

dx. 


8X.  -  008  5 


<X8i  ~  V 

\Re  -  r| 


/(\ R  -  r|2  sin  V*  l  =  1*  2j  3 


where  the  Xe±  are  the  heliocentric  position  components  of  the  Earth;  the  oxi 
are  the  navigation  star  unit  vector  heliocentric  components;  R  and  r  are 
the  Earth  and  vehicle  heliocentric  position  vectors.  The  rest  of  the  partial 
derivatives  are  zero. 


Appendix  D 


THE  ERGODIC  PROCESS 

In  a  process  which  is  ergodic  in  the  mean,  theory  prodirts  68.27£ 
for  normally  distributed  errors  (46,  98).  If  the  process  is  onjntlic  in  the 
mean,  the  time  sample  averages  are  representative  of  the  ensemble  sample 
averages,  that  Is  averaging  more  and  more  values  along  the  time  axis  results 
In  convergence  to  the  ensemble  mean.  A  necessary  and  sufficient  condition 
that  a  process,  x(t) ,  0  <_  t  <  «,  be  ergodic  in  the  mean  is  that  it  be  wi de¬ 
sense  or  weakly  stationary,  and  that 
T 

Urn  J7  f  E{[x(t)  -  y]2}  d  t  -  0 
T-* «  1  JQ 

or  In  the  discrete  case, 

1  N-J 

Um  77  5Z  £([*  -  y]2}  =  0 
N-*°  N  i=l 

See,  for  example,  (46,  166-167).  The  assumption  of  ergodlcity  in  the  mean  for 
the  range  rate  error  process,  p (t),  is  reasonable  as  long  as  the  process  is 
stationary  and  the  time  average  converges  to  E{p}  =  0,  a  condition  guaranteed 
if  the  limit  equations  above  are  fulfilled.  In  our  investigation,  this  is  not 
strictly  the  case;  however,  for  short  time  periods  it  is  approximately  true. 
Further,  for  a  given  mean  and  covariance,  the  Gaussian  distribution  represents 
the  maximum  uncertainty,  entropy,  (47,  613).  Thus  the  comparison  of  the  time 
sample  with  theoretical  normal  (Gaussian)  distribution  limits  is  reasonable 
as  long  as  the  ergodlcity  assumption  is  justifiable. 
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